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CONTENTS OVERVIEW 


For ease of use for our customers, Integrated Device Technology provides four separate data books: 
High-Performance Logic, Specialized Memories and Modules, RISC and RISC SubSystems, and Static 
RAM. 

IDT's 1992/93 RISC Data Book is comprised of new and revised data sheets for the RISC and RISC 

Subsystem product lines. Also included is a current packaging section for the products included in this 
book. . 
The 1992 RISC Data Book’s Table of Contents is a listing of the products contained in this data book 
only (in the past, we have also included products that appeared in other IDT data books). The numbering 
scheme for the book is consistent with the 1990—91 data books. The number at the bottom center of the 
page denotes the section number and the sequence of the data sheet within that section, (i.e., 5.5 would 
be the fifth data sheet in the fifth section). The number in the lower right-hand corner is the page number 
for that particular data sheet. 

Integrated Device Technology, Inc. is a recognized leader in high-speed CMOS and BiCMOS 
technology and produces a broad line of products. This enables us to provide complete CMOS and 
BiCMOS solutions to designers of high-performance digital systems. Not only do our product lines include 
industry standard devices, they also feature products with faster speeds, lower power, and package and/ 
or architectural benefits that allow designers to significantly improve system performance. 


To find ordering information: Ordering Information for all products in this book appears in Section 
1, along with the Package Marking Description, Product Selector Guide, and Ordering Information. 
Reference data on our Technology Capabilities, Quality Commitments, and Package Diagram Outlines 
is included in Sections 2, 3, and 4, respectively. 


To find product data: Start with the Table of Contents, organized by product line (page 1.2), or with 
the Numeric Table of Contents (page 1.3). These indexes will direct you to the page on which the complete 
technical data sheet can be found. Data sheets may be of the following type: 

ADVANCE INFORMATION—contain initial descriptions (subject to change) for products that are in 
development, including features, block diagrams, and target specifications. 

PRELIMINARY—contain descriptions for products soon to be, or recently, released to production, 
including features, pinouts, and block diagrams. Timing data are based on simulation or initial character- 
ization and are subject to change upon full characterization. 

FINAL—contain minimum and maximum limits specified over the complete voltage supply and 
temperature range for full production devices. 


New products, product performance enhancements, additional package types, and new product 
families are being introduced frequently. Please contact your local IDT sales representative to determine 
the latest device specifications, package types, and product availability. 


ABOUT THE COVER 


The cover shows IDT’s family of RISController™ products, which includes the R3041™ ,R3051™ , and 
R3081™. These R3000-derivative microprocessors are pin- and software-compatible, and have been 
designed to address specific price/performance targets for embedded applications. The background 
shows an R3081 wafer at approximately 1.2x magnification. 

IDT also offers a variety of RISC-based board-level solutions, evaluation tools, and development 
support. IDT’s success in the proliferation of application-optimized. microprocessors is the result of 
blending the MIPS high-performance RISC architecture with our state-of-the-art process technology and 
wide-ranging system expertise. 
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LIFE SUPPORT POLICY 


Integrated Device Technology's products are not authorized for use as critical components in life support devices or systems 

unless a specific written agreement pertaining to such intended use is executed between the manufacturer and an officer of IDT. 

1. Life support devices or systems are devices or systems which (a) are intended for surgical implant into the body or (b) support 
or sustain life and whose failure to perform, when properly used in accordance with instructions for use provided in the 
labeling, can be reasonably expected to result in a significant injury to the user. 

2. Acriticalcomponentis any component ofa life support device or system whose failure to perform can be reasonably expected 
to cause the failure of the life support device or system, or to affect its safety or effectiveness. 


Note: Integrated Device Technology, Inc. reserves the right to make changes to its products or specifications at any time, without notice, in order to improve 
design or performance and to supply the best possible product. IDT does notassume any responsibility for use of any circuitry described other than the circuitry 
embodied in an IDT product. The Company makes no representations that circuitry described herein is free from patent infringement or other rights of third 
parties which may result from its use. No license is granted by implication or otherwise under any patent, patent rights or other rights, of Integrated Device 
Technology, Inc. a | 


The IDT logo is a registered trademark, and BUSMUX, Flexi-pak, BICEMOS, CacheRAM, CEMOS, FASTX, Flow-thruEDC, IDT/c, IDT/envy, IDT/kit, DT/sae, 
IDT/sim, IDT/ux, MacStation, REAL8, RISC SubSystem, RISChipset, RISController, RISCore, SmartLogic, SyncFIFO, eysiomeantores, TargetSystem, 
IDT79R3041, IDT79R3051, and IDT79R3081 are trademarks of Integrated Device Technology, Inc. 

Adobe, the Adobe logo, PostScript, PixelBurst, and the PostScript logo are trademarks of Adobe Systems Incorporated which may be registered in certain 
jurisdictions. All other trademarks are trademarks of their respective companies. 
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ORDERING INFORMATION 


When ordering by TWX or Telex, the following format must be used: 
Complete Bill To. 
Complete Ship To. 
Purchase Order Number. 
Certificate of Conformance. Y or N. 
Customer Source Inspection. Y or N. 
Government Source Inspection. Y or N 
Government Contract Number and Rating. 
Requested Routing. 
IDT Part Number — 


~ITOMNMMOOD>Y 


Each item ordered must use the complete part number exactly as s listed in the price book. 


SCD Number — Specification Control Document (Internal Mavelet). 
Customer Part Number/Drawing Number/Revision Level ~ 


AS 


customer specification is required. 


Request Date With Exact Quantity. 
Unit Price. 
Special Instructions, Including Q.A. Clanas Special Processing. 


OozZ=Er 


Federal Supply Code Number/Cage Number — 61772. 
Dun & Bradstreet Number — 03-814-2600 

Federal Tax |.D. — 94-2669985 

TLX# — 887766 

FAX# — 408-727-3468 


PART NUMBER pee 


N= Numeric Character . . a 
999 A _A _ A 
- PROCESS/ SPECIAL — 

TEMPERATURE PROCESS. 


i 


~ BLANK 
M* 
B 


A= Alpha Character 
IDT XXXXX A X 
_ REVISION SPEED ‘PACKAGE 


DEVICE TYPE POWER 





SPEED 


A 
Blank 


TYPE*™* 


PACKAGE DESCRIPTION TABLE 


CERAMIC SIDEBRAZE 

CERDIP 

FLATPACK 

PIN GRID ARRAY 

PLASTIC LEADED CHIP CARRIER 
LEADLESS CHIP CARRIER 
PLASTIC DIP 

SOJ 


Cc 
D 
F 
G 
J 

L 

Pp 
v. 


*Consult Factory 
**For Logic, the "54" series (e.g. IDT54FCT138) — —55°C to +125°C 
the "74" series (e.g. IDT74FCT138) — 0°C to +70°C 


1.4 


POWER 
DEVICE 


Specify whether part number is for reference only, mark only, or if extended processing to 


Customer General Specification Numbers/Other Referenced Drawing Numbers/Revision Levels. 


RADIATION TOLERANT 


COMMERCIAL — 0°C to +70°C 


COMMERCIAL — —55°C to +125°C 
MILITARY — —55°C to +125°C cy compliant 
to MIL-STD-883, Method 5004, Class B) 


SEE PACKAGE DESCRIPTION TABLE 


GUARANTEED MINIMUM PERFORMANCE 
MEASURED IN NANOSECONDS OR MHz 


S — STANDARD POWER 
L — LOW POWER 


e.g. 6116 


PLASTIC FLATPACK 

PLASTIC SMALL OUTLINE IC 
SIDEBRAZE THINDIP (300 MIL) 
PLASTIC THIN DUAL IN-LINE 
CERQUAD GULL WING 
CERPACK (F11 CONFIG. ONLY) 
FINE-PITCH LCC 








IDT PACKAGE MARKING DESCRIPTION 


PART NUMBER DESCRIPTION 


IDT’s part number identifies the basic product, speed, 
power, package(s) available, operating temperature and 
processing grade. Each data sheet has a detailed description, 
using the part number, for ordering the proper product for the 
user’s application. The part number is comprised of a series 
of alpha-numeric characters: 


1. An “IDT” corporate identifier for Integrated Device 
Technology, Inc. 

2. A basic device part number composed of alpha-numeric 
characters. 

3. A device power identifier, composed of one or two alpha 
characters, is used to identify the power options. In most 
cases, the following alpha characters are used: 

“S” or “SA” is used for the standard power product. 
“L” or “LA” is used for lower power than the standard 
power product. 


Example for Monolithic Devices: 


IDT XXX...XXXK XX 


* Field Identifier Applicable To All Products 


ASSEMBLY LOCATION DESIGNATOR 

IDT uses various. locations for assembly. These are 
identified by an alpha character in the last letter of the date 
code marked on the package. Presently, the assembly 
location alpha character is as follows: 


A = Anam, Korea 
| = USA 
P = Penang, Malaysia 


4. Adevice speed identifier, when applicable, is either alpha 
characters, such as “A” or “B”, or numbers, such as 20 or 
45. The speed units, depending on the product, are in 
nanoseconds or megahertz. 

5. Apackage identifier, composed of one or two characters. 
The data sheet should be consulted to determine the 
packages available and the package identifiers for that 
particular product. | 

6. Atemperature/process identifier. The productis available 
in either the commercial or military temperature range, 
processed to a commercial specification, or the productis 
available in the military temperature range with full 
compliance to MIL-STD-883. Many of IDT’s products 
have burn-in included as part of the standard commercial 
process flow. 

7. Aspecialprocess identifier, composed of alphacharacters, 
is used for products which require radiation enhancement 
(RE) or radiation tolerance (RT). 





Special Process 
Process/Temperature” 
Package” 

Speed 

Power 


Device Type* 


2507 drw 01 


MIL-STD-883C COMPLIANT DESIGNATOR 


IDT ships military products which are compliantto the latest 


revision of MIL-STD-883C. Such products are identified by a 
“C” designation onthe package. The location of this designator 
is specified by internal documentation at IDT. 





HIGH-SPEED CMOS | 
MICROPROCESSOR FAMILY 


PRODUCT SELECTOR GUIDE 


Integrated Device Technology, Inc.. 








Broadest range of high-performance to low-cost, code- ia Buffer, 73720 Bus Exchanger, R3721 DRAM Controller — 





compatible RISC processors: R3000A, R4000 CPUs, R3001, * Applications range from real-time control to multiprocessing 
R3041, R3051/52, R3081 RISControllers™ , R3010A FPA, systems 
-R3500 RISCore™ ¢ Optimizing compilers for C, Pascal, FORTRAN, Ada, PL/1 afd: 
* R4000—third-generation high- -performance 64-bit CPU and Cobol 
FPA with on-chip cache ~ ¢ R3000, R3001, R3041, R3051/52, R3081, and R3500 are 
* R3001, R3041, R3051/52, and R3081 RiSController Family— 100% code-compatible 
designed for lower cost embedded systems, all code- ome * R3010A Floating Point Accelerator—conforms with IEEE 754 
ible with original R3000 1985 Standard 
* R3500 RISCore—combines CPU and FPA, pin- Bnd Sw. * R3020 Write Buffer enhances CPU performance by allowing 
“compatible with the R3000A memory "write-through” during run cycles | 
- Support chips designed for RISC systems: R3020 Write * Low-cost Evaluation Boards available — 
oo . Data — 
os fea. : | 7 se 7 Book 
Part Number Description | | Pkgs. Avail. Page 
RISC CMOS MICROPROCESSORS } 
IDT79R4000, Very high-performance, highly integrated 64-bit CPU, 447PGA NOW 5.7 
79R4000A fully binary compatible with the R3000A. Combines 179PGA 


CPU, floating-point and 16KB of cache (32KB for R4000A) 
capable of over 50 VAX mips sustained performance 


IDT79R3000A RISC CPU Processor, 20-40MHz, on-chip Cache . 144PGA NOW 5.1 


Control, Memory Management Unit, 64-Entry Translation 175PGA 
Lookaside Buffer, Thirty-two 32-bit General Purpose 172FP 
Registers 160MQUAD 
IDT79R3001 RISController, aenwalive of the R3000 designed for 144PGA - NOW 5.2 
lower cost embedded systems. Achieves high 172FP 


performance with reduced memory parts count, lower 
overall system cost, and includes real-time features 


IDT79R3041 RISController, R3000A core, 4- ates read/write buffers, — ~ 84PLCC 1Q'93 5.4 
2.5KB on-chip cache 
IDT79R3051/52 RlSControllers, 6kB or 10kB on- rehip cache, R3000A 84PLCC NOW 55 


CPU core, and 4-deep read/write buffers, low-cost 
84-pin plastic packaging 


IDT79R3081™ RlSController, 20kKB on-chip cache, R3000 CPU core, 84MQUAD NOW 5.6 
R3010A Floating Point Accelerator, 4-deep read/write 84PGA 
; _ buffers, pin-compatable with 3051/3052 : . rd - 
IDT79R3500 RISCore integrates R83000A CPU and R3010A FPA using i 161PGA NOW . 53 


the R3000A pinout: Up to 32 VUPS sustained 
performance at 40MHz 


IDT79R3010A RISC Floating-Point Accelerator, 20-40MHz 84PGA NOW 6.1 











84FP 
RISC SUPPORT DEVICES | 
IDT73720 16-Bit Tri-Port Bus Exchanger 68PLCC NOW 6.5 
80PQFP 
IDT79R3721 DRAM Controller, Interfaces to R3051/52 84PLCC NOW 6.4 
IDT79R3020 RISC CPU Write Buffer 68PLCC NOW 6.3 
IDT79R3730 integrated System Controller 208PQFP 1Q'93 6.6 








High-Speed CMOS RISC Microprocessor Family (Cont'd) 





Data 
Book 

Part Number Description Pkgs. Avail. Page 
RISC DEVELOPMENT SYSTEMS | 
IDT7RS503 R3000 NuBus™ add-in card for Macintosh® |I, includes NOW 7.11 

RISC/os and C Compiler, code development and 

debugging environment 
RC3230 Magnum series desktop development host. Entry-level NOW 

system for 15-20 users, rated at 17.8 SPECmarks with 

25MHz R3000 CPU. 33MHz model now available. 
RC3240 Desk-side development host. For medium size projects, NOW . 

AT expansion slots, R3000-25 for 16SPECmarks performance 
RC3260 Pedestal development host. Higher-end multi-user. NOW 
R3000 FAMILY EVALUATION TOOLS (See RISC SubSystems) 
IDT7RS382/383/385_—-R3000 Family Evaluation Boards NOW Tal 
R3000 FAMILY MIPS SOFTWARE 
79SFOR-2n-RTU FORTRAN RISCompiler™ NOW 
79SPAC-2n-RTU Pascal RISCompiler NOW 
79SANC-2n-RTU ANS! C RISCompiler NOW 
79S5053-n AT&T C++ translator NOW 
79SSPP-5-SRC1 SPP (System Programmer's Package) for R3000 NOW 
79SSPE-5-SRC1 SPP/e for embedded systems. Excludes simulation NOW 

programs Cache 2000 and SABLE for R3000 
79SRWN-3n-RTU RISCwindows™ operation environment NOW 
79SRCM-2n-RTU DECnet™ communication software NOW 
79SSPP41BUI SPP (System Programmer's Package) for R4000 Binary NOW 
79SSPE41BUI SPP/e for R4000 Binary NOW 
79SSPS41Bul Sable simulator for R4000 Binary . | NOW 
79SSPC41 BUI Cache simulator for R4000 Binary NOW 
79SSP55BUI Sable Simulator for R3000 Binary NOW 
79SSPC5BU! Cache Simulator for R3000 Binary | NOW 


NOTE: All development systems (MacStation and MIPS) come standard with RISC/os (UNIX®) and C-Compiler software. Additional memory, disk peripherals, tape 
peripherals and interface options are available from IDT for MIPS development systems. 


Integrated RISC Design Solutions 


IDT is committed to providing complete integrated RISC solutionsby The MacStation™ R3000 development board that is hosted in the 
combining expertise in silicon process technology with leadership Macintosh II (P/N IDT7RS503) comes complete with IDT/ux™ (UNIX 
products in development systems and software. Long an industry operation system), the language compiler and software debugging 
leader in producing the fastest static RAMs for cache memory and__ tools. IDT complementary components and modules include: 
high-speed logic for memory interface, IDT offers: : 
Cache Memories 


Part No. Description Frequency Access Time 


Dedicated RISC support chips 


¢ MIPS compilers and cross-software for PC and Sun | 
IDT71586 4k x 16 (w/Latch) 25MHz 25ns 
¢ CPU and cache modules 
; ' IDT7164 8k x 8 33MHz 12ns 
¢ RISC evaluation and prototyping vehicles 
bs <3 IDT7198 16k x 4 33MHz 12ns 
¢ Monitors and debuggers = 
; ba IDT61298 64k x 4 (W/OE) 33MHz 12ns 
ee ee IDT6198 16k x4 33MH 12 
¢ MIPS development hosts and | ‘ i ‘i ” 
IDT71B229 16kx9x2 40MHz 12ns 


Macintosh development system 
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High- epee CMOS RISC Microprocessor Family (Cont'd) 


Bus intertece Logic 

IDT74ECT373A/C Octal. Latch 
IDT74FCT374A/C Octal Register 
IDT74FCT240A/C Octal Buffer 
IDT74FCT244A/C Octal Buffer 
IDT29FCT520A/B Multilevel Pipeline Register 
IDT49FCT804A Tri-Port Bus Multiplexer 


Peripheral Components 
73200 Read-write Buffer | 
73201 Read-write Buffer 
73210 Read-write Buffer 
73211 Read-write Buffer 


-Standard Versions: 


~ IDT7MB6064 __ 


Cache Modules 


Dual’ 64kB 


“< IDT7MB6139_ 
IDT7MB6043 Dual 32kB 
IDT7MB6044 Dual 16kB. 
-- IDT7MP6074 R4000 Secondary Cache Modules 
_- Multiprocessing: j 
IDT7MB6049 - Dual 64kB. 
_ IDT7MB6051 Dual 32kB 
_ IDT7MB6061. - Dual 64kB w/Resettable |-Cache 
Dual Resettable 16kB | 


Reduce Your Development Time with RISC  SubSystems, 


FASTER SYSTEMS: FASTER DESIGN CYCLES 


Using RISC technology, you can build systems that will run rings 
around an old 386 or 680x0 design. IDT's RISC SubSystems” — 


Division can help you get your design completed in record time. 
IDT has proven RISC design and manufacturing experience that 
you can rely on. Exploit our expertise by having IDT design and 
manufacture your board. Or integrate one:of our pre-built, fully- 
tested modules into your design. IDT also offers a full range of 
development support including prototyping hardware, software 
tools, and the powerful MacStation 3 development system. 


CUSTOM DESIGN AND MANUFACTURING 

IDT has successfully designed and produced over thirty boards 
and modules based on the MIPS RISC architecture. We have 
become experts in quickly bringing.quality products to market. The 
IDT advantage has five components: (1) extensive hardware 


design experience, (2) internal: surface mount manufacturing . 


capability, (3) complete suite of diagnostic tests, (4)experience in 
hardware/software integration, (5) a detailed understanding of 
component characteristics. Take advantage of this experience 


by allowing us todesign and manufacture your board. Orworkwith -. 


us in a joint development program where we handle the CPU 
interface so you can concentrate on product differentiation. 


MODULES 


Our modules contain the RISC CPU, Floating Point Accelerator, 


and all the cache memory: Most include clock control, interrupt 
and initialization logic, and read and write buffers, as well. Allthe 
components are surface-mounted on small, plug-in PC boards, 
burned-in and tested at the rated speed. All the tricky timing, and 


~ high- speed Aeaau is done and tested for you. The modules canbe 


plugged into motherboards containing main memory, I/O, and the 
rest of the system, all of which is relatively low speed and is — to 
lay out using conventional design techniques. 


PROTOTYPING PLATFORMS | 


To shorten your design time even more, we offer Prototyping 
Platforms for the modules. The Prototyping Platform contains main 
memory, serial /O, a powerful.debug monitor in EPROM, and a 


personality card that interfaces it directly to one of our modules. You 


can download your software onto the Prototyping Platform, and/or 
design additional hardware and plug it in. 


MACSTATION 3—DEVELOPMENT SYSTEM 

IDT offers a complete R3000 development system in a Macintosh || 
computer. Click aniconon your Mac and a new window opens under 
MultiFinder with the UNIX operating system in it. The UNIX code is 
actually running on a fast R3000 system board inside the Mac, and 
you can run all the MIPS and IDT development tools, including the 
C compiler and the System Programmers’ Package. The MacStation 
3 is available in 15mips and 25mips versions. 


SOFTWARE 

IDT's RISC SubSystems Divi sion offers R3000 software that makes 
software development faster and easier. You can use our System 
Integration Manager to control prototypes, to debug software, and to 
manage I/O drivers. You can put our Monitor into your hardware to 
control the basic system startup. You can use the Kernel Integration 
Toolkit as a building block to quickly develop software. And our 
cross-software for PCs, MIPS and Sun work-stations makes efficient 
C-program compilation possible with readily. available equipment. 





RISC Subsystems (Cont'd) 


Data 
Book 
Model Description Avail. Page 
R3000-BASED CPU MODULES AND PROTOTYPING HARDWARE 
IDT7RS109 64KB each of I- and D-cache. Supports Multiprocessor. On-board NOW 8.1 
parity, clock, reset, interrupt control. Speeds up to 33MHz. 
IDT7RS110 32KB each of I- and D-cache. Single-word Read and Write buffers. NOW 8.2 
Small size ideal for embedded applications. Speeds up to 33MHz. 
IDT7RS409 Prototyping Systems including a 33MHz 7RS109 module. Each NOW 


prototyping system includes 1MB of static RAM, Counter/Timer, 
IDT/sim™ debug monitor in EPROM, serial and parallel !/O ports. 
Quick way to begin development of R3000 hardware and software. 


IDT7RS410 Prototyping Systems including a 33MHz 7RS110 module. Each NOW 
prototyping system includes 1MB of static RAM, Counter/Timer, 


IDT/sim debug monitor in EPROM, serial and parallel I/O ports. 

EVALUATION BOARDS 

IDT79S385A R3051 Family Evaluation Board. Complete, self-contained system NOW 7.7 
requiring only a power supply and simple terminal to be operational. 
Contains R3052E CPU, 1MB of DRAM, !DT/sim monitor in EPROM, 


serial I/O ports, C compiler, R308 1 sample. Supplied with all schematics and user's manual. 
IDT79S389 R3051 Family Laser Printer Controller Reference Platform for NOW 7.5 


PostScript Level 2 Software from Adobe. 
SOFTWARE DEVELOPMENT SYSTEMS 
IDT7RS503 MacStation 3 Development System. Complete R3000 CPU on NOW 7.11 
NuBus card that plugs into a Mac II. Runs IDT/ux™ (UNIX SVR38) in a 
window under MultiFinder. Available in 15mips and 25mips versions. 
Supplied with MIPS C/ compiler. Other MIPS software products are 
available for the MacStation 3. 
R3000 HARDWARE AND SOFTWARE DEVELOPMENT TOOLS 


IDT7RS901 IDT/sim System Integration Manager. Powerful, flexible debug mon- NOW 7.8 
itor for R3000-, R3001- and R305x-based systems. Includes trace, 
single-step, cache control, many more functions. Easily extensible. 
Includes support for source-level debug. Available in source code. 


IDT7RS903 IDT/c. Multihost optimizing C compiler, available for operation on NOW 7.9 
386/486 machines (MS-DOS or UNIX), for MIPS or MacStation 
under RISC/os, and for Sun SparcStation. Generates efficient R3000 
code. Includes floating point libraries for efficient operation without an FPU. 
IDT7RS909 IDT/kit. Kernal Integration Toolkit. Consists of a set of modules ready NOW 7.10 
to be linked with user-developed code to provide an operating system 
kernal for R3000-based embedded systems. Functions are provided for 
initializing systems, memory management, handling interrupts, servicing 
floating point exceptions and time support. Libraries are included for floating 
point emulation, transcendental math routines, and ANSI standard C 
functions. It also contains IDT's Micromonitor, a very simple monitor for 
initial debug of new hardware. The Micromonitor requires only that the CPU, 
RAM, and a serial port be operational. All code is supplied in source code (C 
and assembly), to allow easy access for any modifications needed to tailor 
the system to the specific application. Code is also supplied in standard archive 
library format (compiled with IDT/c and MIPS tool chains) for Big and Little 
Endian targets and for hardware floating point and software FP emulation. 
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IDT...LEADING THE CMOS FUTURE 


A major revolution is taking place in the semiconductor 
industry today. A new technology is rapidly displacing older 
NMOS and bipolar technologies as the workhorse of the '80s 
andbeyond. Thattechnology is high-speed CMOS. Integrated 
Device Technology, a company totally predicated on and 
dedicated to implementing high-performance CMOS products, 
is on the leading edge of this dramatic change. 

Beginning with the introduction of the industry’s fastest 
CMOS 2K x 8 static RAM, IDT has grown into a company with 
multiple divisions producing a wide range of high-speed 
CMOS and BiCMOS circuits that are, inalmost every case, the 
fastest available. These advanced products are produced 
with IDT’s proprietary technology, a twin-well, dry-etched, 
stepper-aligned process utilizing progressively smaller 
dimensions. 

From inception, IDT’s product strategy has been to apply 


the advantages of its extremely fast CMOS technology to 


produce the integrated circuit elements required to implement 
high-performance digital systems. IDT’s goalis to provide the 
circuits necessary to create systems which are far superior to 
previous generations in performance, reliability, cost, weight, 
and size. Many of the company’s innovative product designs 
offer higher levels of integration, advanced architectures, 
higher density packaging and system enhancement features 
that are establishing tomorrow's industry standards. The 
company is committed to providing its customers with an ever- 
expanding series of these high-speed, lower-power IC solutions 
to system design needs. 

IDT's commitment, however, extends beyond state-of-the- 
arttechnology and advanced products to providing the highest 
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level of customer service and satisfaction in the industry. 
Manufacturing products to exacting quality standards that 
provide excellent, long-term reliability is given the same level 
of importance and priority as device performance. IDT is also 
dedicated to delivering these high-quality advanced products 
on time. The company would like to be known not only for its 
technological capabilities, but also for providing its customers 
with quick, responsive, and courteous service. 

IDT’s product families are available in both commercial and 
military grades. As abonus, commercial customers obtain the 
benefits of military processing disciplines, established to meet 
or exceed the stringent criteria of the applicable military 
specifications. 

IDT is a leading U.S. supplier of high-speed CMOS and 
BiCMOS circuits. The company’s high-performance fast 
SRAM , FCT logic, high-density modules, FIFOs, multi-port 
memories, BiCMOS ECL I/O memories, RISC SubSystems, 
and the 32- and 64-bit RISC microprocessor families 
complement each other to provide high-speed CMOS and 
BiCMOS solutions for awide range of applications and systems. 

Dedicated to maintaining its leadership position as a state- 
of-the-art IC manufacturer, IDT will continue to focus on 
maintaining its technology edge as well as developing a 
broader range of innovative products. New products and 
speed enhancements are continuously being added to each 
of the existing product families, and additional product families 
are being introduced. Contact your IDT field representative or 
factory marketing engineer for information on the most current 
product offerings. Ifyou’re building state-of-the-art equipment, 
IDT wants to help you solve your design problems. 








IDT MILITARY AND DESC-SMD PROGRAM 


IDT is a leading supplier of military, high-speed CMOS 
circuits. The company’s high-performance Static RAMs, FCT 
Logic Family, Complex Logic (CLP), FIFOs, Specialty 
Memories (SMP), ECL I/O BiCMOS Memories, 32-bit RISC 
Microprocessor, RISC Subsystems and high-density 
Subsystems Modules product lines complement each other to 
provide high-speed CMOS solutions to a wide range of 
military applications and systems. Most of these product lines 
offer Class B products which are fully compliant to the latest 
revision of MIL-STD-883, Paragraph 1.2.1. In addition, IDT 
offers Radiation Tolerant (RT), as well as Radiation Enhanced 
(RE), products. ao | 

IDT has an active program with the Defense Electronic 
Supply Center (DESC) to list all of IDT’s military compliant 


devices on Standard Military Drawings (SMD). The SMD 
program allows standardization of militarized products and 
reduction of the proliferation of non-standard source control 
drawings. This program will go far toward reducing the need 
for each defense contractor to make separate specification 
control drawings for purchased parts. IDT plans to have 
SMDs for many of its product offerings. Presently, IDT has 88 
devices which are listed or pending listing. The devices are 
from IDT’s SRAM, FCT Logic family, Complex Logic (CLP), 
FIFOs and Specialty Memories (SMP) product families. IDT 
expects to add another 20 devices to the SMD program in the 
near future. Users should contact either IDT or DESC for 
current status of products in the SMD program. 
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RADIATION HARDENED TECHNOLOGY 


{DT manufactures and supplies radiation hardened. products 
for military/aerospace applications. Utilizing special processing 
and starting materials, IDT’s radiation hardened devices survive 
in hostile radiation environments. !n Total Dose, Dose Rate, 
and environments where single event upset is of concern, IDT 
products are designed to continue functioning without loss of 
performance. IDT can supply all its products on these 
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processes. Total Dose radiation testing is performed in-house 
on an ARACOR X-Ray system. External facilities are utilized 
for device research on gamma cell, LINAC and other radiation 
equipment. IDT has an on-going research and development 
program for improving radiation handling capabilities (See 
“IDT Radiation Tolerant/Enhanced Products for Radiation 
Environments” in Section 3) of IDT products/processes. 








IDT LEADING EDGE CMOS TECHNOLOGY 


HIGH-PERFORMANCE CMOS | 

From IDT’s beginnings i in 1980, it has had a belief in eed: a 
commitment to CMOS. The company developed a high- 
performance version of CMOS that allows the design and 


_ manufacture of leading-edge components. Itincorporates the 


best characteristics of traditional CMOS, including low power, 
high noise immunity and wide operating temperature range; it 
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CMOS IV = CMOS Ill — scaled process optimized for high-speed logic. 


also’ achieves speed and output drive equal or superior to 
bipolar Schottky TTL. The last decade has seen development 
and production of four “generations” of IDT’s CMOS technology 
with process improvements which have reduced IDT’s electrical 
effective (Leff) gate lengths by more than 60 percent from 1.3 
microns (millionths of a meter) in 1981 to0.45 microns in 1990. 
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Figure 1. 


Continual advancement of CMOS technology allows IDT to 
implement progressively higher levels of integration and 
achieve increasingly faster speeds maintaining the company’s 
established position as the leader in high-speed CMOS 


' integrated circuits. In addition, the fundamental process 


technology has been extended to add bipolar elements to the 
CMOS platform. IDT’s BICMOS process combines the ultra- 
high speeds of bipolar devices with the lower power and cost 
of CMOS, allowing us to build even faster components than 
straight CMOS at a slightly higher cost. 
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Figure 2. Fifteen-Hundred-Power Magnification Scanning Electron 
Microscope (SEM) Photos of the Five Generations of IDT's CMOS 
Technology 
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Figure 3. IDT CMOS Device Cross Section 


ALPHA PARTICLES 

Random alpha particles can cause memory cells to 
temporarily lose their contents or suffer a “soft error.” Traveling 
with high energy levels, alpha particles penetrate deep into an 
integrated chip. As they burrow into the silicon, they leave a 
trail of free electron-hole pairs in their wake. 

The cause of alpha particles is well documented and 
understood in the industry. IDT has considered various 
techniques to protect the cells from this hazardous occurrence. 
These techniques include dual-well structures (Figures 3 and 
4) and a polymeric compound for die coating. Presently, a 
polymeric compoundis used in many of IDT’s SRAMs; however, 
the specific techniques used may vary and change from one 
device generation to the next as the industry and IDT improve 
the alpha particle protection technology. 


LATCHUP IMMUNITY 


A combination of careful design layout, selective use of 
guard rings and proprietary techniques have resulted in virtual 
elimination of latchup problems often associated with older 
CMOS processes (Figure 5). The use of NPN and N-channel 
I/O devices eliminates hole injection latchup. Double guard 
ring structures are utilized on all input and output circuits to 
absorb injected electrons. These effectively cut off the current 
paths into the internal circuits to essentially isolate I/O circuits. 
Compared to older CMOS processes which exhibit latchup 
characteristics with trigger currents from 10-20mA, IDT 
products inhibit latchup at trigger currents substantially greater 
than this. 
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Figure 4. IDT CMOS Built-In High Alpha Particle Immunity 
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Figure 5. IDT CMOS Latchup Suppression 
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SURFACE MOUNT TECHNOLOGY 
AND 
IDT’S eee PRODUCTS 


Requirements for circuit area reduction, utilizing the most 
efficient and compact component placement possible and the 


needs of production manufacturing for electronics assemblies 


are the driving forces behind the advancement of circuit-board 
assembly technologies. These needs are closely associated 
with the advances being made in surface mount devices 
(SMD) and surface mount technology (SMT) itself. Yet, there 
are two major issues with SMT in production manufacturing of 
electronic assemblies: high capital expenditures and com- 
plexity of testing. 

The capital expenditure required to convert to efficient 
production using SMT is still too high for the majority of 
electronics companies, regardless of the 20-60% increase in 
the board densities which SMT canbring. Because of this high 
barrier to entry, we will continue to see a large market segment 
[large even compared to the exploding SMT market] using 
traditional through-hole packages (i.e. DIPs, PGAs, etc) and 
assembly techniques. How can these types of companies 
take advantage of SMD and SMT? Let someone else, such 
as IDT, doit for them by investing time and money in SMT and 
then in return offer through-hole products utilizing SMT proc- 
esses. Products which fit this description are multi-chip 
modules, consisting of SMT assembled SMDs on a through- 
hole type substrate. Modules enable companies to enjoy SMT 
density advantages and traditional package options without 
the expensive startup costs required to do SMT in-house. 

Although subcontracting this type of work to an assembly 
house is an alternative, there still is the other issue of testing, 
an area where many contract assembly operations fall short 
of IDT’s capability and experience. Prerequisites for ade- 
quate module testing sophisticated high-performance par- 
ametric testers, customized test fixtures, and most impor- 
tantly the experience to tests today’s complex electronic 
devices. Companies can therefore take advantage of IDT’s 
experience in testing and manufacturing high-performance 
CMOS multi-chip modules. 

At IDT, SMD components are electrically tested, environ- 
mentally screened, and performance selected for each IDT 
module. All modules are 100% tested as if they are aseparate 
functional component and are guaranteed to meet all speci- 
fied parameters at the module output without the customer 
having to understand the modules’ internal workings. 


Other added benefits companies get by using) IDT’ s CMOS 
module products are: 

1) awide variety of high-performance, fhroual: -hole prod- 
ucts utilizing SMD packaged components, 

2) fast speeds compared with NMOS based products, 

3) low power consumption compared with bipolar tech- 
nologies, and 

4) lowcost manrigeien) colnpalee with GaAs- based 
products. 


IDT has recognized the SObiene of SMT and began 
offering CMOS modules as part of its standard product port- 
folio. IDT modules combine the advantages of: 

1) the low power characteristics = ue Ss “yee and 

BiCMOS products, 

2) thedensity advantages of first class SMD sanoneiis 

including those from IDT’s components divisions, and 

3) experience insystem level design, manufacturing, and 

testing with its own in-house SMT operation. 


IDT currently has two divisions (Subsystems and RISC 
Subsystems) dedicated to the development of module prod- 
ucts ranging from simple memory modules to complex VME 
sized application specific modules to full system-level CPU 
boards. These modules have surface mount devices as- 
sembled on both sides of either a multi-layer glass filled epoxy 
(FR-4) or a multi-layer co-fired ceramic substrate. Assembled 
modules come available in industry standard through-hole 
packages and other space-saving module packages. Indus- 
try proven vapor-phase or IR reflow techniques are used to 
solder the SMDs to the substrate during the assembly proc- 
ess. Because of our affiliation with IDT’s experienced semi- 
conductor manufacturing divisions, we thoroughly under- 
stand and therefore test all modules to the applicable da- 
tasheet specifications and customer requirements. . 

Thus, IDT is able to offer today's electronic design engi- 
neers a unique solution for their “need-more-for-less” 
problem.modules. These high speed, high performance 
products offer the density advantages of SMD and SMT, the 
added benefit of low power CMOS technology, and through- 
hole packaged electronics: mined the high cost of cong it in- 
house. : 





STATE-OF-THE-ART FACILITIES AND CAPABILITIES 


‘Integrated Device Technology is headquartered in Santa 
Clara, California—the heart of “Silicon Valley.” The company’s 
operations are housed in six facilities totaling over 500,000 
square feet. These facilities house all aspects of business 
from research and development to design, wafer fabrication, 
assembly, environmental screening, test, and administration. 
In-house capabilities include scanning electron microscope 
(SEM) evaluation, particle impact noise detection (PIND), 
plastic and hermetic packaging, military and commercial 
testing, burn-in, life test, anda full complement of environmental 
screening equipment. | 

The over-200,000-square-foot corporate headquarters 
campus is composed of three buildings. The largest facility on 
this site is a 100,000 square foot, two-building complex. The 
first building, a 60,000-square-foot facility, is dedicated to the 
Standard Logic and RISC Microprocessor product lines, as 
wellas hermetic and plastic package assembly, logic products’ 
test, burn-in, mark, QA, and a reliability/failure analysis lab. 

IDT’s Packaging and Assembly Process Development 
teams are located here. To keep pace with the development 
of new products and to enhance the IDT philosophy of 
“innovation,” these teams have ultra-modern, integrated and 
correspondingly sophisticated equipment and environments 
at their disposal. All manufacturing is completed in dedicated 
clean room areas (Class 10K minimum), with all preseal 
operations accomplished under Class 100 laminar flow hoods. 

Development of assembly materials, processes and 
equipment is accomplished under a fully operational production 
environment to ensure reliability and repeatable product. The 
Hermetic Manufacturing and Process Development team is 
currently producing custom products to the strict requirements 
of MIL-STD-883. The fully automated plastic facility is currently 
producing high volumes of USA-manufactured product, while 
developing state-of-the-art surface-mount technology 
patterned after MIL-STD-883. 

Thesecond building of the complex houses sales, marketing, 
finance, MIS, and Northwest Area Sales. 

The RISC Subsystems Division is located across from the 
two-building complex in a 50,000-square-foot facility. Also 
located at this facility are Quality Assurance and wafer 
fabrication services. Administrative services, Human 
Resources, International Planning, Shipping and Receiving 
departments are also housed in this facility. 
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IDT’s largest and newest facility, opened in 1990 in San 
Jose, California, is a multi-purpose 150,000-square-foot, ultra- 
modern technology development center. This facility houses 
a 25,000 square foot, combined Class 1 (a maximum of one 
particle-per-cubic-foot of 0.2 micron or larger), sub-half-micron 
R&D fabrication facility and a wafer fabrication area. This fab 
supports both production volumes of IDT products, including 
some next-generation SRAMs, and the R&D efforts of the 
technology development staff. Technology development efforts 
targeted for the center include advanced silicon processing 
and wafer fabrication techniques. A test area to support both 
production and research is located on-site. The building is 
also the home of the FIFO, ECL, and Subsystems product 
lines. 

IDT’s second largest facility is locatedin Salinas, California, 
about an hour south of Santa Clara. This 95,000-square-foot 
facility, located on 14 acres, houses the Static RAM Division 
and Specialty Memory product line. Constructed in 1985, this 
facility contains an ultra-modern 25,000-square-foot high- 
volume wafer fabrication area measured at Class 2-to-3 (a 
maximum of 2 to 3 particles-per-cubic-foot of 0.2 micron or 
larger) cleanroom conditions. Careful design and construction 
of this fabrication area created a clean room environment far 
beyond the 1985 average for U.S. fab areas. This made 
possible the production of large volumes of high-density 
submicron geometry, fast static RAMs. This facility also houses 
shipping areas for IDT’s leadership family of CMOS and 
BiCMOS static RAMs. This site can expand to accommodate 
a 250,000-square-foot complex. 

To extend our capabilities while maintaining strict control of 
our processes, IDT has an operational Assembly and Test 
facility located in Penang, Malaysia. This facility assembles 
product to U.S. standards, with all assemblies done under 
laminar flow conditions (Class 100) until the silicon is encased 
in its final packaging. All products in this facility are 
manufactured to the quality control requirements of MIL-STD- 
883. 

All of IDT’s facilities are aimed at increasing our 
manufacturing productivity to supply ever-larger volumes of 
high-performance, cost-effective, leadership CMOS products. 








SUPERIOR QUALITY AND RELIABILITY | 


- Maintaining the highest standards of quality in the industry 
on all products is the basis of Integrated Device Technology’s 
manufacturing systems and procedures. From inception, 
quality and reliability are built into all of IDT’s products. Quality 
is “designed in” at every stage of manufacturing — as opposed 
_ to being “tested-in” later — in order to ensure impeccable 

performance. . tan 

Dedicated commitment to fine workmanship, along with 
development of rigid controls throughout wafer fab, device 
assembly andelectrical test, create inherently reliable products. 
Incoming materials are subjected to careful inspections. Quality 
monitors, or inspections, are performed throughout the 
manufacturing flow. 

IDT military grade monolithic hermetic products are designed 
to meet or exceed the demanding Class B reliability levels of 
MIL-STD-883 and MIL-M-38510, as defined by Paragraph 
1.2.1 of MIL-STD-883. 

Product flow and test procedures for all monolithic hermetic 
military grade products are in accordance with the latest 
revision and notice of MIL-STD-883. State-of-the-art production 
techniques and computer-based test procedures are coupled 
with tight controls and inspections: to ensure that products 


meet the requirements for 100% screening. Routine quality 


conformance lot testing is performed as defined i in Blea 
883, Methods 5004 and 5005. 

* For IDT module products, screening of the fully assembled 
substrates is performed, in addition to the monolithic level 
screening, to assure package integrity and mechanical 


reliability. All modules receive 100% electrical tests (DC, 
functional and dynamic switching) to ensure compliance with 
the "subsystem" specifications. | 

By maintaining these high siandards and rigid eenirals 
throughout every step of the manufacturing process, IDT 
ensures that commercial, industrial and military grade products 
consistently meet customer pequire ments for quality, ley 
and performance. 


SPECIAL PROGRAMS 

Class S. IDT also has all manufacturing, screening and 
test capabilities in-house (except X-ray and some Group D 
tests) to perform complete Class S processing per MIL-STD- 
883 on all IDT products and has supplied Class S products on 
several programs. 

Radiation Hardened. IDT has developed and supplied 
several levels of radiation hardened products for military/ 
aerospace applications to perform at various levels of dose 
rate, total dose, single event upset (SEU), upset and latchup. 
IDT products maintain nearly their same high-performance 
levels built to these special process requirements. The 
company has in-house radiation testing capability used both 
in process development and testing of deliverable product. 
IDT also has a separate group within the company dedicated 
to supplying products for radiation hardened applications and 
to continue research and development of process and products 
to further improve radiation hardening capabilities. 
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QSP-—QUALITY, SERVICE AND PERFORMANCE 


Quality from the beginning, is the foundation for IDT’s 
commitment to supply consistently high-quality products to 
our customers. IDT’s quality commitment is embodied in its all 
pervasive Continuous Quality Improvement (CQI) process. 
Everyone who influences the quality of the product-from the 
designer to the shipping clerk—is committed to constantly 
improving the quality of their actions. 


IDT QUALITY PHILOSOPHY 

“To make quantitative constant improvement in the quality 
of our actions that result in the supply of leadership products 
in conformance to the requirements of our customers.” 


IDT's ASSURANCE STRATEGY FOR CQI 

Measurable standards are essential to the success of CQI. 
Allthe processes contributing to the final quality of the product 
need to be monitored, measured and improved upon through 
the use of statistical tools. 


DEVELOPMENT 


| 
FAB 


| 
PRODUCT FLOW ASSEMBLY 


| 
TEST 

| 
SHIP 


Our customers receive the benefit of our optimized systems. 
Installed to enhance quality and reliability, these systems 
provide accurate and timely reporting on the effectiveness of 
manufacturing controls and the reliability and quality 
performance of IDT products and services. 


ORDER ENTRY 


| 
PRODUCTION CONTROL 
SERVICE FLOW | 
SHIPPING 


| 
CUSTOMER SUPPORT 


Thesesystems and controls concentrate on CQI|by focusing 
on the following key elements: 


Statistical Techniques 
Using statistical techniques, including Statistical Process 
Control (SPC) to determine whether the product/ 
processes are under controi. 


Standardization 
Implementing policies, procedures and measurement 
techniques that are common across different operational 
areas. 


Documentation 
Documenting and training in policies, procedures, 
measurement techniques and updating through 
characterization/ capability studies. 


Productivity Improvement 
Using constant improvement teams made up from 
employees at all levels of the organization. 


Leadership 
Focusing on quality as a key business parameter and 
strategic strength. 


Total Employee Participation 
Incorporating the CQ! process into the IDT Corporate 
Culture. 


Customer Service 
Supporting the customer, as a partner, through 
performance review and pro-active problem solving. 


People Excellence 
Committing to growing, motivating and retaining people 
through training, goal setting, performance measurement 
and review. 


PRODUCT FLOW 


Product quality starts here. IDT has mechanisms and 
procedures in place that monitor and control the quality of our 
development activities. From the calibration of design capture 
libraries through process technology and product 
characterization that establish whether the performance, 
ratings and reliability criteria have been met. This includes 
failure analysis of parts that will improve the prototype product. 

At the pre-production stage once again in-house qualification 
tests assure the quality and reliability of the product. All 
specifications and manufacturing flows are established and 
personnel trained before the product is placed into production. 


Manufacturing 

To accomplish CQI during the manufacturing stage, control 
items are determined for major manufacturing conditions. 
Data is gathered and statistical techniques are used to control 
specific manufacturing processes that affect the quality of the 
product. 
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In-process and final inspections are fed back to earlier 
processes to improve product quality. All product is burned- 
in (where applicable) before 100% inspection of a 
characteristics takes place. 

Products which pass final inspection are then sinject. to 
Quality Assurance and Reliability Tests. This data is used to 
improve manufacturing processes and provide reliability 
predictions of field applications. 


Inventory and Shipping | 

Controls in shipping focus on ensuring parts are identified 
and packaged correctly. Care is also taken to see that the 
correct paperwork is present and the product being shipped 
was processed COmmeelly 


SERVICE FLOW 

Quality not only applies to the product but to the quality-of- 
service we give our customers. Service is also constantly 
monitored for improvement. 


Order Procedures 

_Checks are made at the order entry stage to ensure the 
correct processing of the Customer's product. After verification 
and data entry the Acknowledgements (sent to Customers) 
are again checked to ensure details are correct. As part of the 
CQI process, the results of these verifications are analyzed 
using statistical techniques and corrective actions are taken. 


Production Control 
Production Control (P.C.) is responsible for the flow and 

logistics of material as it moves through the manufacturing 
processes. The quality of the actions taken by P.C. greatly 
influences the quality of service the customer receives. 
Because many of our customers have implemented Just-in- 
Time (JIT) manufacturing practices, IDT as a supplier has 
adopted these same disciplines. As a result, employees 
receive extensive training and the performance level of key 
actions are kept under constant review. These key actions 
include: 

Quotation response and accuracy. 

Scheduling response and accuracy. 

Response and accuracy of Expedites. 

Inventory, management, and effectiveness. 

On-time delivery. 


- Customer Sappone 
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IDT has a worldwide network of aig offices and Technical 
Development Centers. These provide local customer support 
on business transactions, and in addition, support customers 
on applications information, technical services, benchmarking 
of hardware solutions, and> demonstration of various 
Development Workstations. 

The key to CQI is the timely rasciuiien of defects and 
implementation of the corrective actions. This is no more 
important than when product failures are found by a customer. 
When failures are found at the customer’s incoming inspection, 
in the production line, or the field application, the Division 
Quality Assurance group is the focal point for the investigation 
of the cause of failure and implementation of the corrective 
action. IDT constantly improves the level of support we give 
our customers by monitoring the response time to customers 
who have detected a product failure. Providing the customer 
with an analysis of the failure, including corrective actions and 
the statistical analysis of defects, brings CQI full circle—full 
support of our customers and their designs with high-quality 
products. | 


SUMMARY 

In 1990, IDT made the commitment to “Leadership through 
Quality, Service, and Performance Products”. 

We believe by following this credo IDT and our customers 
will be successful in the coming decade. With the 
implementation of the CQI strategy within the company, we 
will satisfy our goal... 

“Leadership through Quality, Service and Performance 
Products”. 





IDT QUALITY CONFORMANCE PROGRAM 


A COMMITMENT TO QUALITY 


Integrated Device Technology’s monolithic assembly 
products are designed, manufactured and tested in accordance 
with the strict controls and procedures required by Military 
Standards. The documentation, design and manufacturing 
criteria of the Quality and Reliability Assurance Program were 
developed and are being maintained to the most current 
revisions of MIL-38510 as defined by paragraph 1.2.1 of MIL- 
STD-883 and MIL-STD-883 requirements. 

Product flow and test procedures for all Class B monolithic 
hermetic Military Grade microcircuits are in full compliance 
with paragraph 1.2.1 of MIL-STD-883. State-of-the-art 
production techniques and computer-based test procedures 
are coupled with stringent controls and inspections to ensure 
that products meet the requirements for 100% screening and 
quality conformance tests as defined in MIL-STD-883, Methods 
5004 and 5005. 

Product flow and test procedures for all plastic and 
commercial hermeticproducts are in accordance with industry 
practices for producing highly reliable microcircuits to ensure 
that products meet the IDT requirements for 100% screening 
and quality conformance tests. 

By maintaining these high standards and rigid controls 
throughout every step of the manufacturing process, |DT 
ensures that our products consistently meet customer 
requirements for quality, reliability and performance. 


SUMMARY 


Monolithic Hermetic Package Processing Flow") 

Refer to the Monolithic Hermetic Package Processing Flow 
diagram. All test methods refer to MIL-STD-883 unless 
otherwise stated. 

1. Wafer Fabrication: Humidity, temperature and 
particulate contamination levels are controlled and 
maintained according to criteria patterned after Federal 
Standard 209, Clean Room and Workstation 
Requirements. Allcriticalworkstations are maintained 
at Class 100 levels or better. 


Wafers from each wafer fabrication area are subjected 
to Scanning Electron Microscope analysis on aperiodic 
basis. 


2. Die Visual Inspection: Wafers are cut and 
separated and the individual die are 100% visually 
inspected to strict IDT-defined internal criteria. 


3. Die Shear Monitor: To ensure die attach integrity, 


product samples are routinely subjected to a shear 
strength test per Method 2019. 


NOTE: 


4. 


10. 


11. 


12. 


Wire Bond Monitor: Product samples are routinely 
subjected to a strength test per Method 201 1, Condition 
D, to ensure the integrity of the lead bond process. 


Pre-Cap Visual: Before the completed package is 
sealed, 100% of the product is visually inspected to 
Method 2010, Condition B criteria. 


Environmental Conditioning: 100% of the sealed 
product is subjected to environmental stress tests. 
These thermal and mechanical tests are designed to 
eliminate units with marginal seal, die attach or lead 
bond integrity. 


Hermetic Testing: 100% of the hermetic packages 
are subjected to fine and gross leak seal tests to 
eliminate marginally sealed units or units whose 
seals may have become defective as a result of 
environmental conditioning tests. 


Pre-Burn-in Electrical Test: Each product is 100% 
electrically tested atan ambient temperature of +25°C 
to IDT data sheet or the customer specification. 


Burn-in: 100% of the Military Grade product is 
burned-in under dynamic electrical conditions to the 
time and temperature requirements of Method 1015, 
Condition D. Except for the time, Commercial Grade 
product is burned-in as applicable to the same 
conditions as Military Grade devices. 


Post-Burn-In Electrical: After burn-in, 100% of the 
Class B Military Grade product is electrically tested to 
IDT data sheet or customer specifications over the — 
55°C to +125°C temperature range. Commercial 
Grade products are sample tested to the applicable 
temperature extremes. 


Mark: All product is marked with product type and lot 
code identifiers. MIL-STD-883 compliant Military 
Grade products are identified with the required 
compliant code letter. 


Quality Conformance Tests: Samples of the 
Military Grade product which have been processed to 
the 100% screening tests of Method 5004 are routinely 
subjected to the quality conformance requirements of 
Method 5005. 


1. For quality requirements beyond Class B levels such as SEM analysis, X-Ray inspection, Particle Impact Noise Reduction (PIND) test, Class S screening 
or other customer specified screening flows, please contact your integrated Device Technology sales representative. 
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SUMMARY 


Monolithic Plastic Package Processing Flow 
Refer to the Monolithic Plastic Package ‘Processing Flow 
diagram. All test methods refer to MIL-STD-883 unless 


otherwise stated. 
4. 


Wafer Fabrication: Humidity, temperature and 


particulate. contamination levels are controlled and 


maintained according to criteria patterned after Federal 
Standard 209, Clean Room and Workstation 
Requirements. Allcritical workstations are maintained 


7 at Class 100 levels or better. 


Topside silicon nitride passivation is all applied to all 
wafers for better moisture barrier characteristics. 


_Wafers from each wafer fabrication area are subjected 


to Scanning Electron Microscope analysis on a periodic 
basis. | | 


Die Visual Inspection: Wafers are 100% visually 
inspected to strict IDT defined internal criteria. 


Die Push Test: To ensure die attach integrity, product 


samples are routinely subjected to die push tests, 
patterned after MIL-STD-883, Method 2019. 


Wire Bond Monitor: Product samples are routinely 


subjected to wire bond pull and ball shear tests to 


ensure the integrity of the wire bondprocess, patterned 
after MIL-STD-883, Method 2011, Condition D. 


Pre-Cap Visual: Before encapsulation, all product 
lots are visually inspected (using LTPD 5 sampling 


___ Plan) to criteria patterned after MIL-STD- 883, Method 
© 2010, Condition B. 


3.2. 


Post Mold Cure: Plastic encapsulated devices are — 


_.. baked to ensure an optimum polymerization of the 


10. 


11. 


code identifiers. 


epoxy mold compound so as to ennance: moisture . | 
resistance characteristics. | 


Bis-Bure ih: Electrical: Each «ac is 100% : 
electrically tested atan ambient temperature of +25°C 
to IDT data sheet or the customer specification. 


Burn-in: Except for MSI Logic family devices where 


it may be obtained as an option, all Commercial _ 
Grade plastic package products are burned-in for 16. 
hours at +125°C. minimum (or equivalent), utilizing. 
the same burn-in conditions. as the Military Grade © 
product. | 


Post-Burn-in Electrical: After burn-in, 100% of the — 


_ plastic product is electrically tested to IDT data sheet 


or customer specifications at the maximum 
temperature extreme. The minimum temperature 


extreme is tested periodically on an audit basis. 


Mark: All product is marked with product type and lot 
Products are identified with the ; 
assembly and test locations. 


Quality Conformance Inspection: Samples of the 
plastic product which have been processed to the 
100% screening requirements are subjected to the - 
Periodic Quality Conformance Inspection Program. 
Where indicated, the test methods are patterned after 
MIL-STD- si criteria. 





TABLE 1 - 


This table defines the device class screening procedures for IDT's high reliability products in conformance with MIL-STD-883C. 


Monolithic Hermetic Package Final Processing Flow 


a CLASS. cass 
OPERATION | TESTMETHOD  |RQMT| TESTMETHOD | RQMT| TESTMETHOD |ROMT| 


BURN-IN | 1015 Cond. D, 100% | 1015 Cond. D, 100% | Per applicable 100% 
240 Hrs @ 125°C or 160 Hrs. @ 125°C min device specification |. 
equivalent or equivalent 

POST BURN-IN 100% 100% 

ELECTRICAL: | 


Static (DC), Functional 
100% 
device specification device specification: 


and Switching (AC) 
100% 
425°C +25°C 
-IDT Spec 100% | IDT Spec 100% 
| 5005 Group B,C,D. IDT Spec 
IDT Spec | 100% | IDT Spec 100% | IDT Spec 
(Visual/Plant Clearance) : | . 
NOTES: . 


1. Class-C = IDT commercial spec. for hermetic and plastic packages 
2. Typical O°C, 70°C, Extended -55°C +125°C 












































Per applicable (). 
device specification 


Per applicable 
device specification 
+25, -55 and 125°C 


Per applicable 
device specification 
+25, -55 and 125°C 








































Per applicable () 
device specification 


Per applicable 
device specification 
and 5005 


‘Per applicable 
device specification 
and 5005 








Group A ELECTRICAL: 
Static (DC), Functional 
and Switching (AC) 
















MARK/LEAD 
STRAIGHTENING 


IDT Spec IDT Spec 






IDT Spec 







Per applicable . 
device specification |- 
+#25°C = 














FINAL ELECTRICAL 
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| QUALITY CONFORMANCE 


5005 Group B, C, D. 
INSPECTION = 







QUALITY SHIPPING 
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RADIATION TOLERANT/ENHANCED/HARDENED PRODUCTS FOR : 


RADIATION ENVIRONMENTS ~ 


INTRODUCTION 
_ The need for high-performance CMOS integrated circuits 
in military and space systems is more critical today than ever 
before. The low power ‘dissipation that is achieved using 
CMOS technology, along with the high complexity and density 
levels, makes CMOS the nearly ideal component for all types 
of applications. , 
Systems designed for military or space applications are 
intended for environments where high levels of radiation may 
be encountered. The implication of a device failure within a 
_ military or space system clearly is critical. IDT has made a 
significant contribution toward providing reliable radiation- 
tolerant systems by offering integrated circuits with enhanced 
radiation tolerance. Radiation environments, IDT process 
enhancements and device tolerance levels achieved are 
described below... | - 3 


THE RADIATION ENVIRONMENT 


There are four different types of radiation environments 
that are of concern to builders of military and space systems. 


These environments and their effects on the device operation, . 


summarized in Figure 1, are as follows: | 
Total Dose Accumulation refers to the total amount of 
accumulated gamma rays experienced by the devices in the 


system, and is measured in RADS (Sl) for radiation units .. 


experienced at the silicon level. The physical effect of gamma 
rays on semiconductor devices is to cause threshold shifts (Vt 
shifts) of both the active transistors as well as the parasitic field 
transistors. Threshold voltages decrease as total dose is 
accumulated; at some point, the device will begin to exhibit 
parametric failures as the input/output and supply currents 
increase. At higher radiation accumulation levels, functional 
failures occur. Inmemory circuits, however, functional failures 
due to memory cell failure often occur first. 


Burst Radiation or Dose Rate refers to the amount of 


radiation, usually photons or electrons, experienced by the 
devices in the system due to a pulse event, and is measured 
in RADS (Si) per second. The effect of a high dose rate or 
burst of radiation on CMOS integrated circuits is to cause 
temporary upset of logic states and/or CMOS latch-up. Latch- 
up can cause permanent damage to the device. 

Single Event Upset (SEU) is a transient logic state change 
caused by high-energy ions, such as energetic cosmic rays, 
striking the integrated circuits. As the ion passes through the 
silicon, charge is either created through ionization or direct 
nuclear collision. If collected by a circuit node, this excess 
charge can cause a change in logic state of the circuit. 
Dynamic nodes that are not actively held at a particular logic 
state (dynamic RAM cells for example) are the most susceptible. 
These upsets are transient, but can cause system failures 
known as “soft errors.” 

Neutron Irradiation will cause structural damage to the 
silicon lattice which may lead to device leakage and, ultimately, 
functional failure. 


Neutron Neutrons Nuclear 
Event 














Radiation | | . Primary 
Category | Particle 
, a Permanent 


TotalDose | Gamma Space or 
Nuclear 
Event 












Dose Rate © Photons | Nuclear Temporary 
| - Event Upset of Logic 
State or 
3 _| Latch-up 





SEU Cosmic | Space Temporary 
Rays -— | Upset of 
Logic State | 


Device Leakage 
Due to Silicon 
Lattice Damage 
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Figure 1. 


DEVICE ENHANCEMENTS 


Of the four radiation environments above, IDT has taken 
considerable data on the first two, Total Dose Accumulation 
and Dose Rate. IDT has developed aprocess that significantly 
improves the radiation tolerance of its devices within these 
environments. Prevention of SEU failures is usually 
accomplished by system-level considerations, such as Error 
Detection and Correction (EDC) circuitry, since the occurrence 
of SEUs is not particularly dependent on process technology. 
Through IDT’s customer contracts, SEU has been gathered 


on some devices. Little is yet known about the effects of 


neutron-induced damage. For more information on SEU 
testing, contact IDT’s Radiation Hardened Product Group. | 

Enhancements to IDT’s standard process are used to 
create radiation enhanced and tolerant processes. Field and 
gate oxides are “hardened” to make the device less susceptible 
to radiation damage by modifying the process architecture to 
allow lower temperature processing. Device implants and Vts 
adjustments allow more Vt margin: .In addition to process 
changes, IDT’s radiation enhanced process utilizes epitaxial 
substrate material. The use of epi substrate material provides 
a lower substrate resistance environment to create latch-up 
free CMOS structures. 


RADIATION HARDNESS CATEGORIES 
Radiation Enhanced (RE) or Radiation Tolerant (‘RT) 
versions of IDT products follow IDT’s military product data 
sheets whenever possible (consult factory). IDT’s Total Dose 
Test plan exposes a sample of die on a wafer to a particular 
Total Dose level via ARACOR X-Ray radiation. This Total 
Dose Test plan qualifies each ‘RE or ‘RT wafer toa Total Dose 
level. Only wafers with sampled die that pass Total Dose level 
tests are assembled and used for orders (consult factory for 
more details on Total Dose sample testing). With regard to 
Total Dose testing, clarifications/exceptions to MIL-STD-883, 
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Methods 5005 and 1019 are required. Consult factory for 

more details. 

The ‘RE and ‘RT process enhancements enable IDT to 
offer integrated circuits with varying grades of radiation 
tolerance or radiation “hardness”. 

* Radiation Enhanced process uses Epi wafers and is able 
to provide devices that can be Total Dose qualified to 10K 
RADs (Si) or greater by IDT’s ARACOR X-Ray Total Dose 
sample die test plan (Total Dose levels require negotiation, 
consult factory for more details). 

¢ Radiation Tolerant product uses standard wafer/process 
material that is qualified to 10K RADs (Si) Total Dose by 
IDT’s ARACOR X-Ray Total Dose sample die test plan. 
Integrated Device Technology can provide Radiation 

Tolerant/Enhanced versions of all product types (some speed 

grades may not be available as ‘RE). 

Please contact your IDT sales representative or factory 
marketing to determine availability and price of any IDT 
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product processed in accordance with one of these levels of 
radiation hardness. 


CONCLUSION 


There has been widespread interest within the military and 
space community in IDT’s CMOS product line for its radiation 
hardness levels, as well as its high-performance and low 
power dissipation. To serve this growing need for CMOS 
circuits that must operate in a radiation environment, IDT has 
created a separate group within the company to concentrate 
on supplying products for these applications.Continuing 
research and development of process and products, including 
the use of in-house radiation testing capability, will allow 
Integrated Device Technology to offer continuously increasing 
levels of radiation-tolerant solutions. 
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THERMAL PERFORMANCE CALCULATIONS FOR IDT’S PACKAGES 


Since most of the electrical energy consumed by 
microelectronic devices eventually appears as heat, poor 
thermal performance of the device or lack of management of 
this thermal energy can cause a variety of deleterious effects. 
This device temperature increase can exhibit itself as one of 
the key variables in establishing device performance and long 
term reliability; on the other hand, effective dissipation of 
internally generated thermal energy can, if properly managed, 
reduce the deleterious effects and improve component 
reliability. 

A few key benefits of IDT’s enhanced CMOS process are: 
low power dissipation, high speed, increased levels of 
integration, wider operating temperature ranges and lower 
quiescent power dissipation. Because the reliability of an 
integrated circuit is largely dependent on the maximum 
temperature the device attains during operation, and as the 
junction stability declines with increases in junction temperature 
(Ty), it becomes increasingly important to maintain a low (Ty). 

CMOS devices stabilize more quickly and at greatly lower 
temperature than bipolar devices under normal operation. 
The accelerated aging of an integrated circuit canbe expressed 
as an exponential function of the junction temperature as: 


ta=toexp | Ea f_1-_1 
k To TJ 


where 
tA = lifetime at elevated junction (TJ) temperature 
to = normal lifetime at normal junction (TO) temperature 
Ea = activation energy (ev) 
k = Boltzmann's constant (8.617 x 10ev/k) 


i.e. the lifetime of a device could be decreased by a factor of 
2 for every 10°C increase temperature. 


To minimize the deleterious effects associated with this 

potential increase, IDT has: 

1. Optimized our proprietary low-power CMOS fabrication 
process to ensure the active junction temperature rise 
is minimal. 

2. Selected only packaging materials that optimize heat 
dissipation, which encourages a cooler running device. 

3. Physically designed all package components to 
enhance the inherent material properties and to take 
full advantage of heat transfer and radiation due to 
case geometries. 


Ref. MIL-STD-883C, Method 1012.1 
JEDEC ENG. Bulletin No. 20, January 1975 
1986 Semi. Std., Vol. 4, Test Methods G30-86, G32-86. 


4. Tightly controlled the assembly procedures to meet or 
exceed the stringent criteria of MIL-STD-883 to ensure 
maximum heat transfer between die and packaging 
materials. 

The following figures graphically illustrate the thermal values 
of IDT’s current package families. Each envelope (shaded 
area) depicts a typical spread of values due to the influence of 
a number of factors which include: circuit size, package 
materials and package geometry. The following range of 
values are to be used as a comprehensive characterization of 
the major variables rather than single point of reference. 

When calculating junction temperature (TJ), it is necessary 
to know the thermal resistance of the package (6JA) as 
measured in “degree celsius per watt”. With the accompanying 
data, the following equation can be used to establish thermal 
performance, enhance device reliability and ultimately provide 
you, the user, with a continuing series of high-speed, low- 
power CMOS solutions to your system design needs. 


QJA = [TJ — TA}/P | 
Td = TA + P[QJA] = TA + P[6JNC + OCA] 


where 

JIC = TJ—TC OCA = IC—TA 
P P 

68 = Thermal resistance 

J = Junction 

P = Operational power of device (dissipated) 

TA = Ambient temperature in degree celsius 

TJ = Temperature of the junction 

Tc = Temperature of case/package 

@cA = Case to Ambient, thermal resistance—usually a 
measure of the heat dissipation due to natural or 
forced convection, radiation and mounting 
techniques. 

@uc = Junction to Case, thermal resistance—usually 
measured with reference to the temperature at a 
specific point on the package (case) surface. 
(Dependent on the package material properties 
and package geometry.) 

@JA = Junction to Ambient, thermal resistance—usually 


measured with respect to the temperature of a 
specified volume of still air. (Dependent on O/C + 
QJA which includes the influence of area and 
environmental condition.) 
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PACKAGE DIAGRAM OUTLINE INDEX 
SECTION PAGE 


MONOLITHIC PACKAGE DIAGRAM OUTLINES .....0... oe. cc ccseeseeseseessesssensneseeneesnenneees 4.3 
PKG. DESCRIPTION 
G84-2 84-Ledd:- Pin Gra Array (Cavity: GOWN) siccecsesescewsssiitessevessuchosduconvesvanevercoseaausansanhoxcnsaseatiuouioarsaves 8 
G84-4 84-Lead Pin Grid Array (cavity COWN—R3010A) ..............cccccceecceceececeeceeeeeeeeceeeeseeseeeeeeeeeseseeaes 9 
G144-1 144-Lead Pin Grip Array (Cavity GOWN) .............cccccccecssseececesssseseeeeceseseeeeescseeesceseaaeeateateneeeesenas 11 
G144-2 144-Lead Pin Grip Array (Cavity UD — R8001) .... eee ceccsceseceeeecesessseeeeeeessseseaenecssssseaeeees 10 
G144-3 144-Lead Pin Grip Array (cavity down — R3Q000A) ...........ccceccceceeeeeeeseneeeeeeeessneaneneenteaneneneeneess 12 
G161-1 161-Lead Pin:Grid Array (Cavity GOWN) si ccsiscieleesancces sactehicesadecelansseacdassuancteatdouseucrenscecdavvauenneas 13 
G175-1 175-Lead Pin Grid Array (cavity COWN—R3000A) ............ccececcceceeceeeseneeeecerseeeseseneneaueseeeeerees 14 
G179-1 179-Lead Pin Grid Array (Cavity GOWN) ................cccsssessensersssceecneaeeceeceseecescesesssseeasanaesereeseeeeeees 15 
G447-1 BAT load: Pit GHG AAV aiaduciaastutet Sooteniaseventnnbiteinccevdbvapadeeauadevaeetinaavensienuded eaten eieahieeateetias 16 
J20-1 20-Pin Plastic Leaded Chip Carrier (SQUare) ...........ceceeeeeceeceeeeeeeceeeeneeeeeneeeeeeeecesseetetseeeneneeneeees 22 
J28-1 28-Pin Plastic Leaded Chip Carrier (SQUALE) .............ceeeseccecceceeeeeeeeeeceeceeceseeeeceecesteteerereeeeeeeees 22 
J44-4 44-Pin Plastic Leaded Chip Carrier (SQUare) .............ceeceeeeeceeeeeeeseeeeeeeeeeeeeeeeeeseceeeeneeteneeeanaess 22 
J52-1 52-Pin Plastic Leaded Chip Carrier (SQUarE) ...........::cccceecceceseseeseerececeecesencneeeeesseeesaeeesteseeeseees 22 
J68-1 68-Pin Plastic Leaded Chip Carrier (SQUArE) ..............::ccccccccesessesentneceecescetenaeeseeceseneeeeesseseesees 22 
J84-1 84-Pin Plastic Leaded Chip Carrier (SQUare) ...........:ccceceecescssseneceeeeeeeeeeeceeereceesesananerseeenanaaaenes 22 
L20-2 20-Pin Leadless Chip Carrier (Square) ...........:::ccccsseceeeesereeceeeeteees seadutusapiehiavenanacneaes seadauh 6 
L28-1 26-Pin:-Leadless Chip: Carrier(SQUare) vascsccccs: tap avanscdsc actus oracttavaadapuastudcapecadeadecvabaeuctdea wesmsctacss 6 
L44-1 44-Pin Leadless Chip Carrier (Square) ..............0:c00000 saitacnsUbatae se iasehaa tit that haan ee lesals 6 
L48-1 43-Pin: Leadless ‘Chip: Carmel | SQUALG) ccs secs 222 cc cetewavescslecsuwts de vata ectsndedvaeccnsttnaueh cdaeuaeavasianens 6 
L52-1 52-Pin eadiess: Chip Carrer: (SQUaAnE) sessecesiusl evakserencvediewereusecaccaransva ius aodwateagleatecaauanscedee aes 7 
L52-2 52-Pin Headless Chip: Garner (SGU ale) sictac22, 20 ocancoeensasetean. nada cagacuaatanteasetehdennsuana anieekemnae aimee 7 
L68-1 68-Pin Leadless Chip-Carner (SQUAPE) o.citecceceetsecstivedinncceaaanta rena deentlameness 7 
L68-2 68:-Pin:Leadiess Chip Garner (Square) ss sstuAcaseiseesdenstet Meedes Wastes Gneie deta eheiouecs 7 
SO20-1 20-Pin Small Outline IC (J Bend — 300 mil) .......... sobseaiind Gauci ach dasia toca ams etedavatsdevegmeareaaeaenies 17 
S024-4 20-Pin Small Outline IC (J Bend — 300 mil) ........ eee eee cecececeeceseeeceeeeeaeeeecesaceceeeneesessseaanenes 17 
SO024-8 20-Pin Small Outline IC (J Bend — 300 mil) 00... ecceccccceeceessesseneceeeeeceseeneeeeceesseseeeeeeserecees 17 
$028-5 20-Pin Small Outline IC (J Bend — 300 mil) «0... pibieaeiesere cece ce ates Cert ae 17 
SO028-6 28-Pin Small Outline IC (J: Bend: — 400 Mil) cis ssoscvssned cavivedetasvsacsthadeawonieieiondutecesculaeqatedioesaxess 18 
S032-2 32-Pin Small Outline IC (J Bend — 300 Mil) ....... eee ceceeecceeecseseeeeeesaeececssseeceseceesesesnsesanes 17 
SO32-2 32-Pin Small Outline IC (J Bend — 400 mil) siicc.eccsccsscsescescevcesconesescdsavedeasantercccessessvavsdecesoreues 18 
F84-1 84-Lead Quad Flatpack (Cavity GOWN) ............. cc cccsesssssesseecceccecedeccceecuceuseucuseeetsseeceacuseeeeuseunauea 3 
F84-2 84-Lead Quad Flatpack (CAaVIty UD). visi. diviccvascueacesssvednescsvveanisvadnsenanducsadesbebcesandavavsettacwdsnageeias. 2 
F172-1 172-Lead Quad Flatpack (Cavity UD—R3001) 0... cceeesseececesesstecessseeeeesestetecttsesenerstens 
F172-2 172-Lead Quad Flatpack (cavity CoWN—R3000A) .......... ec ccecssseceeeessseeeecnueecesesessessaneaeeeees . 4 
M84-1 84-Lead MQUAD™ (J-bend; Cavity: GOWN): scescsssbcsesuinctivacetadacvinviveustivas soettnalsleveialiccduleedaaetierta: 20 
M160-1 160-Lead MQUAD™ (cavity GOWN) .........cc:cceesceceeeseeesenteeereees Sas uu daca Soseaeunieg esteted eae avatageteate 21 
CQ84-1 B4-eaG CEAOUAD.: airsiiensicecveunet Grenccccagsetereau tate sue opacotcdlouas elauicedunddup tea eanuenatcwideeci er uneiuadvact ee 5 
PQ80-2 80-Lead Rectangular Plastic Quad Flatpack (EIAU) ..... eee eeeceeeeeeeneeeeeesaeeees i cieeetetnees 19 
100-Lead Rectangular Plastic Quad Flatpack (EIAU) .......... cc eceeseeceesesssteeceeseseesseeeeeeeensereaeas 19 
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Module package diagrams are located at the back of each Subsystems data sheet. 
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FLATPACKS (Continued) 
84 LEAD QUAD FLATPACK (CAVITY UP) 
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2. BSC — BASIC LEAD SPACING BETWEEN CENTERS. 
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FLATPACKS (Continued) 
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FLATPACKS (Continued) 
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CERQUADS 
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2, BSC — BASIC LEAD SPACING BETWEEN CENTERS. 
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ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. | 
BSC — BASIC LEAD SPACING BETWEEN CENTERS. 
SYMBOL "M” REPRESENTS THE PGA MATRIX SIZE. 
SYMBOL "N” REPRESENTS THE NUMBER OF PINS 
CHAMFERED CORNERS ARE IDT’S OPTION. 
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PIN GRID ARRAYS (Continued) 
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TALL DIMENSIONS ARE IN NCHES, UNLESS OTHERWSE, 


SPECIFIED. — 7 


BSC — BASIC LEAD SPACING BETWEEN CENTERS. 


SYMBOL “M” REPRESENTS THE PGA MATRIX SIZE. 


SYMBOL "N” REPRESENTS THE NUMBER OF PINS 


CHAMFERED CORNERS ARE IDT’S OPTION. 7 | 
CROSS HATCHED AREA INDICATES INTEGRAL METALLIC . 
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740 6. EXTRA PIN (D—4) ELECTRICALLY CONNECTED TO D—3. 
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PIN GRID ARRAYS (Continued) 
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5. CHAMFERED CORNERS ARE IDT’S OPTION. 

6. CROSS HATCHED AREA INDICATES INTEGRAL METALLIC 

HEAT SINK. 
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PIN GRID ARRAYS (Continued) 
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NOTES: 
1. ALL DIMENSIONS ARE IN INCHES, 
UNLESS OTHERWISE SPECIFIED. 
2. BSC — BASIC LEAD SPACING BETWEEN 
CENTERS. 
cE, £3. D1 & El DO NOT INCLUDE MOLD FLASH 
OR PROTRUSION AND TO BE MEASURED 
FROM THE BOTTOM OF THE PKG. 
4, FORMED LEADS SHALL BE PLANAR WITH 
RESPECT TO ONE ANOTHER WITHIN .004” 
AT THE SEATING PLANE 
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20-32 LEAD SMALL OUTLINE (J-BEND, 300 MIL) 








DWG # $020-1 S024—4 S024—8 S028—5 S032—2 
# OF LDS (N) 
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AT 078 | 095 | 082 | 095 | .078 | .091 | .078 | .095 | .082 | .095 
PBT = = | 026 | 052 | = | | = | = 026 | 032 | 
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po O10 | 020 | 010 | .020 | .010 
pS 025 | 055 | 052 
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NOTES: 
1. ALL DIMENSIONS ARE IN INCHES, UNLESS OTHERWISE 
SPECIFIED. 


2. BSC — BASIC LEAD SPACING BETWEEN CENTERS. 


3. FORMED LEADS SHALL BE PLANAR WITH RESPECT TO 
ONE ANOTHER WITHIN .004” AT THE SEATING PLANE. 


4. D1 & E1 SHOULD BE MEASURED FROM THE BOTTOM 
OF THE PACKAGE. 


S. ND & NE REPRESENT NUMBER OF LEADS IN D & E 
DIRECTIONS RESPECTIVELY. 
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BSC — BASIC LEAD SPACING BETWEEN CENTERS 
3. D & E DO NOT INCLUDE MOLD FLASH OR PROTRUSIONS. 
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IDT RISC PROCESSING COMPONENTS 


THE COMPLETE RISC SOLUTION 


Integrated Device Technology, Inc. is dedicated to provid- 
ing complete RISC design solutions by combining expertise in 
silicon processes with leadership products in development 
systems and software. Long an industry leader in the fastest 
static RAMs and high-speed logic, IDT now offers RISC 
system building blocks comprised of components and board- 
level subsystems. 

As asemiconductor partner with MIPS Computer Systems, 
IDT has established a leadership position in the RISC mar- 
ketplace by supplying the fastest CPUs at 40MHz, pioneering 
RISC CPU Subsystem™ modules, and offering cost-effective 
development tools and software. 

The MIPS architecture has become an industry standard 
and has been adopted by over 100 leading OEM manufactur- 
ers including DEC, Sony, Tandem, NEC, CDC, Adobe, Si- 
emens, Nixdorf, Honeywell Bull and Silicon Graphics. The 
MIPS ISA (Instruction Set Architecture) has been selected by 
JIAWG as the 32-bit microprocessor standard for military 
avionics. 


RISComponent™ FAMILY OVERVIEW 

The R3000 Family consists of the R3000 RISC CPU, the 
R3001, R3041, R3051/52, and R3081 RiSControllers™, the 
R3010A Floating-Point Accelerator, the R3500 RISCore™ 
and the R3020 Write Buffer. The R3000 processor is a 


CPO 
(System Control Coprocessor) 


CONTROL 


derivative of the R2000A, the first commercially-available 
RISC processor introduced in 1985. The R3001 RISController 
and the R3051 family, including the R3041 and R3081, are 
versions of the processor tailored for embedded control and 
low-cost workstations. The R3500 integrates floating-point 
capability onto the R3000 pinout. The R4000 is the third 
generation of the MIPS RISC architecture that sets a new 
performance standard for the 1990s. 


THE IDT79R3000 CPU 


The R3000 processor consists of two tightly-coupled pro- 
cessors implemented on a single chip. 

The first processor is a full 32-bit Harvard Architecture CPU 
consisting of 32 registers, an integer ALU, a single-cycle 
shifter, and a multiplier/divider. The second processor is a 
system contro! coprocessor containing a Translation Look- 
aside Buffer (TLB) and control registers to support a virtual 
memory space of 4GB and separate instruction and Data 
caches. 

The R3000 CPU features: 
¢ Full 32-bit operation 
¢ Three instruction formats 
* Efficient 5-stage pipeline 
¢ On-chip cache control 
* On-chip Memory Management Unit 
¢ Multiprocessor capability 
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Figure 1. IDT79R3000 Processor 








THE IDT79R3010A FLOATING-POINT 
ACCELERATOR 


The R3010A Floating-Point Accelerator (FPA) supports full 
conformance with the IEEE 754 floating-point specification. it 
acts as a coprocessor to the R3000 CPU, providing a seam- 
less integration of fixed and floating-point instructions. All 
floating-point operations are apap en to the programmer. 

The R3010A FPA features: 
¢ Full 64-bit operation | 
¢ Single-cycle load/store instructions 
¢ Seamless interface to the R3000 or R3001 CPU 
- Three operation units (add/subtract, multiply and divide) 

can operate in parallel 
¢ Six-level pipeline 


THE IDT79R3001 RISController™ 


The R3001 RiSController optimizes the high-performance 
MIPS architecture for embedded control systems. Capable of 
28 MIPS performance at 33MHz, the R3001 incorporates new 
features for real-time control. The controller extends the 
performance range of the current R3000 processor, saves 
valuable real estate for space- ues designs and lowers 
system memory costs. 

The MIPS performance range is evens by the increase 
in the R38000 in the synchronous memory space from 512KB, 
maximum, to a full 32MB. This allows the system to perform 
with a guaranteed "cache" hit rate of 100%. The on-chip 
memory controller allows the designer to use standard SRAMs, 
DRAMs, or even VRAMs, representing a significant cost 
savings over other solutions. The processor supports pre- 
dictable interrupt response times for real-time control appli- 
cations, and system chip count is lowered by substantially 


reducing the number of devices needed to implement local | 


memory. 


THE R3051 FAMILY OF RiSControllers™ 


The IDT79R3051 Family is a derivative of the R3000, 
featuring a high level of integration and targeted to high- 
performance but cost-sensitive embedded processing appli- 
cations. The R3051 family is designed to bring the high- 
performance inherent in the MIPS RISC architecture into low- 
cost, simplified, power-sensitive applications. 

Functional units were integrated onto the CPU core in order 
to reduce the total system cost rather than to increase the 
inherent performance of the integer engine. Thus, the R3051 
family is able to offer 35 MIPS of integer performance at 

40MHz without requiring external SRAM or caches. 

The R3041 extends the range of price/performance achiev- 


_ ofusing the MIPS architecture. The R3041 has been designed 


to achieve minimal system and components cost, yet maintain 
the high performance inherent in the MIPS architecture. The 
R3041 also maintains pin and software compatibility with the 
R3051 and R3081. 

The R3081 extends the capabilities of the R3051 family by 
integrating the additional resources into the same pinout. The 
R3081 thus extends the range of applications addressed by 
the R3051 family and allows designers to implementa single, 
base system and software set capable of accepting a wide 
variety of CPUs according to the price/performance goals of 
the end system. 


THE R3500 RISCore™ CPU/FPA 7 
The R3500 is a single chip that integrates the R3000A CPU 

and the R3010A FPA execution units using the R3000A 
packaging and pinout. This high-integration device is com- 
pletely binary software, compatible with the R3000, R2000 
CPUs and R3010A FPA to facilitate the migration path to 
higher performance and lower chip count systems that utilize 
both the CPU and the floating point units. | 

-IDT has also made several enhancements to the R3000 
architecture such as faster multiply and divide instructions and 
added a programmable tag bus width allowing reduced cache 
cost. The power consumption is lower by 33% when com- 
pared to the standard R3000 and R3010A. 


THE IDT79R4000 CPU 


The R4000 is the third generation of MIPS RISC technology 
and establishes a new performance standard for RISC pro- 
cessors for the 1990s. The R4000 extends the performance 
range served by the MIPS architecture and, thereby, provides 
a migration path to applications served by the R3000, R3001, 
and the R3051. 

This third generation processor maintains full binary com- 
patibility with applications executing on the R2000/R3000 and 
IDT's RISController family, while achieving substantially higher 
performance. The key to this performance is both the archi- 


_tecture/implementation of the processor and the high level of 


integration achieved in a single chip. The R4000 contains the 
RISC integer unit, floating-point unit, MMU, 8K of I- and D- 
cache, along with multiprocessing support such as direct 
control of optional secondary caches. To achieve performance 
levels capable of over 50 VAX MIPS sustained performance, 
the R4000 utilizes technology such as super-pipelining to 
exploit 2-level instruction parallelism with no issue restrictions. 
The R4000 presents a balanced architectural approach to 
achieve a wide range of price/performance goals. | 


able with the R3051 family, by dramatically lowering the cost ~ 
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FEATURES: 


Enhanced instruction set compatible version of the 
IDT79R2000, IDT79R3000 RISC CPUs. 

Upwardly pin-compatible with IDT79R3000 RISC CPU. 
IDT79R3000A “E” version relaxes system memory timing 
requirements in a high-speed systems. 

Full 32-bit Operation—Thirty-two 32-bit registers and all 
instructions and addresses are 32-bit. 

Efficient Pipelining—The CPU's 5-stage pipeline design 
assists in obtaining an execution rate approaching one 
instruction per cycle. Pipeline stalls and exceptions are 
handled precisely and efficiently. 

On-Chip Cache Control—The IDT79R3000A provides a 
high-bandwidth memory interface that handles separate 
external Instruction and Data Caches ranging in size from 
4 to 256kB each. Both caches are accessed during asingle 
CPU cycle. All cache control is on-chip. 

On-Chip Memory Management Unit—A fully-associative, 
64-entry Translation Look-aside Buffer (TLB) provides fast 
address translation for virtual-to-physical memory map- 
ping of the 4gB virtual address space. 


IDT79R3000A PROCESSOR BLOCK DIAGRAM 


CPO 
(System Control Coprocessor) 


Master Pipeline/Bus Control 


aT 


Exception/Control 
Registers 


Memory 
Management 
Unit Registers 


Translation 
Lookaside 
Buffer 
(64 entries) 





TAG (20+4) 


CEMOS is a trademark of Integrated Device Technology, Inc. 
UNIX is a Trademark of AT&T 


COMMERCIAL TEMPERATURE RANGE 


CONTROL 













Virtual Page Number/ 
Virtual Address 


ADDRESS (18) 


Dynamically able to switch between Big- and Little-Endian 
byte ordering conventions. 
Coprocessor Interface—The IDT79R3000A generates all 


addresses and handles memory interface control for up to 


three additional tightly coupled external processors. 
Optimizing Compilers are available for C, Fortran, Pascal, 
COBOL, Ada, PL/1, and C++. 

UNIX™ System V.4 and BSD 4.3 operating systems sup- 
ported. 

High-speed CMOS technology. 

16.7 through 40MHz clock rates yield up to 32VUPS sus- 
tained throughput. | 

Supports independent multi-word block refill of both the 
instruction and data caches with variable block sizes. — 
Supports concurrent refill and execution of instructions. 
Partial word stores executed as read-modify-write opera- 
tions. 

6 external interrupt inputs, 2 software interrupts, with single 
cycle latency to exception handler routine. 

Flexible multiprocessing support on chip with no impact on 
uniprocessor designs. 


CPU 


(32x32) 






Data (3244) 
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DESCRIPTION” 


The IDT79R3000A RISC Microprocessor consists of two 
tightly-coupled processors integrated on a single chip. The 
first processor is a full.32-bit CPU based on RISC (Reduced 


Instruction Set Computer) principles to achieve a new stan- 


dard of microprocessor performance. The second processor 
is a system control coprocessor, called CPQ, containing a 
fully-associative 64-entry TLB (Translation Look-aside Buffer), 
MMU (Memory Management Unit) and control registers, sup- 
porting a 4gB virtual memory subsystem, and a Harvard 
Architecture Cache Controller achieving a bandwidth of 320mB/. 
second using industry standard static RAMs. 

__ This datasheet provides an overview of the features and 
architecture of the IDT79R3000A CPU, Revision 3.0. A more 
detailed description of the operation of the device is incorpo- 
ratedin the /DT79R3000A Family Hardware User Manual, and 
amore detailed architectural overviewis provided in the MIPS 
RISC Architecture book, both available from IDT. Documen- 
tation providing details of the software and development 
environments supporting this. Peer are also available 
from IDT. : ¥ } 


IDT79R3000A CPU Registers 

“The IDT79R3000A CPU provides 32 general- purpose 32- 
bit registers, a 32-bit Program Counter, and two 32-bit regis- 
ters that hold the results of integer multiply and divide opera- 
tions. Only two of the 32 general registers have a special 
purpose: register r0 is hard-wired to the value “0”, which is a 
useful constant, and register r31 is used as the link register in 
~ jump-and-link instructions (return address for subroutine calls). 

The CPU registers are shown in Figure 2. Note that there 
is no Program Status Word (PSW) register shown in this 


figure: the functions traditionally provided by a PSW register — 


are instead provided in the Status and Cause registers incor- 
porated within the System Control eee (CPO). 


Instruction Set Overview 


General Purpose Registers 


0 


oO 


Oo 


io) 


[| 
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Figure 2. IDT79R3000A CPU Registers 
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All IDT79R3000A instructions are 32 bits long, and there. 
are only three instruction formats. This approach simplifies. 
instruction decoding, thus minimizing instruction execution 
time. The IDT79R3000A processor initiates a new:instruction’ 


- on every run cycle, and is able to complete an instruction on 


almost every clock cycle. The only exceptions are the Load 
instructions and Branch instructions, which each havea single 
cycle of latency associated with their execution. Note, how- 
ever, that in the majority of cases the compilers are able to fill 
these latency cycles with useful instructions which: do not 
require the result of the previous:instruction. This effectively 
eliminates these latency effects. | 

' The actual instruction set of the CPU was determined after 
extensive simulations to determine which instructions should 
be implemented in hardware, and which operations are best 
synthesized in software from other basic instructions. This 
methodology resulted in the IDT79R3000A having the highest 
performance of any available microprocessor. 


‘The IDT79R3000A instruction set can be divided into the 
I- “Type (Immediate) | 


3126 25 21 20 16 15 





. J-Type (Jump) 
31 26 25 


a -Type (Register) .. | Mee 
15 11 40 6 5 0 


26 25 21 20 16 
fcvaReeR eR) 
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Figure 3. IDT79R3000A Instruction Formats 


lowing groups: 
Load/Store instructions move data between memory and 

general registers. They are all |-type instructions, since the 
Only addressing mode supported is base register plus 16- 
bit, signed immediate offset. 

_ The Load instruction has a single cycle of latency, which 
means that the data being loaded is not available to the 
instruction immediately after the load instruction. The com- 
piler will fill this delay slot with either an instruction which is 
not dependent on the loaded data, or with a NOP instruc- 
tion. There is no latency associated with the Store instruc- 

tion. 

Loads and Stores can be performed on byte, ‘half-word, 

_word, or unaligned word data (32-bit data not aligned ona 
modulo-4 address). The CPU cache is constructed as a 
write-through cache. 

Computational instructions perform arithmetic, logical 
and shift operations on values in registers. They occur in 
both R-type (both operands and the result are registers) 
and |-type (one operand is a 16-bit immediate) formats. 
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Note that computational instructions are three operand 
instructions; that is, the result of the operation can be 
stored into a different register than either of the two 
operands. This means that operands need not be overwrit- 
ten by arithmetic operations. This results in a more efficient 
use of the large register set. 

Jump and Branch instructions change the control flow of 
a program. Jumps are always to a paged absolute address 
formed by combining a 26-bit target with four bits of the 
Program counter (J-type format, for subroutine calls), or 
32-bit register byte addresses (R-type, for returns and 
dispatches). Branches have 16-bit offsets relative to the 
program counter (I-type). Jump and Link instructions save 
a return address in Register 31. The IDT79R3000A in- 
struction set features a number of branch conditions. 
Included is the ability to compare a register to zero and 
branch, and also the ability to branch based on a compari- 
son between two registers. Thus, net performance is 
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increased since software does not have to perform arith- 
metic instructions prior to the branch to set up the branch: 
conditions. 

Coprocessor instructions perform operations in the 
coprocessors. Coprocessor Loads and Stores are I-type. 
Coprocessor computational instructions have coprocessor- 
dependent formats (See coprocessor manuals). 
Coprocessor 0 instructions perform operations on the 
System Control Coprocessor (CPO) registers to manipu- 
late the memory management and exception handling 
facilities of the processor. 

Special instructions perform a variety of tasks, including 
movement of data between special and general registers, 
system calls, and breakpoint. They are always R-type. 


Table 1 lists the instruction set of the IDT79R3000A 


processor. 
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Description  — 


Load/Store Instructions 
Load Byte . 2? 
Load Byte Unsigned 


_ Load Halfword 
|. Load Halfword Unsigned 


Load Word 

Load Word Left 
Load Word Right 
Store Byte 

Store Halfword 
Store Word 

Store Word Left 
Store Word Right 


Arithmetic Instructions 
(ALU Immediate) 

Add Immediate 

Add Immediate Unsigned 
Set on Less Than Immediate 
Set on Less Than Immediate 
Unsigned 

AND Immediate 

OR Immediate 

Exclusive OR Immediate 
Load Upper Immediate 


Arithmetic instructions 
(3-operand, register-type) 
Add , 

Add Unsigned 

Subtract 

Subtract Unsigned 

Set on Less Than 


Set on Less Than Unsigned 
AND 

OR 

Exclusive OR 

NOR 


Shift Instructions 

Shift Left Logical 

Shift Right Logical 

Shift Right Arithmetic 

Shift Left Logical Variable 
Shift Right Logical Variable 


Shift Right Arithmetic Variable 
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JALR 
BEQ 
BNE 
BLEZ 
BGTZ 
BLTZ 
BGEZ 


BLTZAL 
BGEZAL 


SYSCALL 
BREAK 


LWCz 
SWCz 
MTCz 
MFCz 
CTCz 
CFCz 
COPz 
BCzT 
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. Description 


Multiply/Divide Instructions 


Multiply 


_ Multiply Unsigned | 


Divide | 
Divide Unsigned 
Move From HI 
Move To HI 

Move From LO 
Move To LO 


Jump and Branch Instructions 
Jump 


Jump and Link 


Jump to Register 

Jump and Link Register 

Branch on Equal 

Branch on Not Equal 

Branch on Less than or Equal to Zero 
Branch on Greater Than Zero 
Branch on Less Than Zero 

Branch on Greater than or 

Equal to Zero 

Branch on Less Than Zero and Link 
Branch on Greater than or Equal to 
Zero and Link 

Special Instructions 

System Call 

Break 


Coprocessor Instructions 
Load Word from Coprocessor 
Store Word to Coprocessor 
Move To Coprocessor 

Move From Coprocessor 

Move Control to Coprocessor 
Move Control From Coprocessor 
Coprocessor Operation 

Branch on Coprocessor z True 
Branch on Coprocessor z False 


System Control Coprocessor 
(CPO) Instructions 

Move To CPo 

Move From CPo 

Read indexed TLB entry 

Write Indexed TLB entry 

Write Random TLB entry 
Probe TLB for matching entry 
Restore From Exception 
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IDT79R3000A System Control Coprocessor (CPO) 

The IDT79R3000A can operate with up to four tightly- 
coupled coprocessors (designated CPO through CP3). The 
System Control Coprocessor (or CPO), is incorporated on the 
IDT79R3000 chip and supports the virtual memory system 
and exception handling functions of the IDT79R3000A. The 
virtual memory system is implemented using a Translation 
Look-aside Buffer and a group of programmable registers as 
shown in Figure 4. 


SYSTEM CONTROL COPROCESSOR (CP0) 
REGISTERS 

The CPO registers shown in Figure 4 are used to control 
the memory management and exception handling capabilities 
of the IDT79R3000A. Table 2 provides a brief description of 
each register. 





ENTRYHI | ENTRYLO 


63 





NOT ACCESSED 
BY RANDOM 
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SYSTEM CONTROL COPROCESSOR (CPO) 
REGISTERS 


EntryHi 
EntryLo 
Index 
Random 


Status 


Cause 
EPC 
Context 
BadVA 


PRid 





HIGH half of a TLB entry 

LOW half of a TLB entry 
Programmable pointer into TLB array 
Pseudo-random pointer into TLB array 


Mode, interrupt enables, and diagnostic status info 
Indicates nature of last exception 

Exception Program Counter 

Pointer into kernel’s virtual Page Table Entry array 
Most recent bad virtual address 


Processor revision identification (Read only) 
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RANDOM 


[_] Used with Virtual Memory System 
Used with Exception Processing 
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Figure 4. The System Coprocessor Registers 
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Memory Management System 

The IDT79R3000A has an sadisene range of: 4gB. 
However, since most IDT79R3000A systems implement a 
physical memory smaller than 4gB, the IDT79R3000A pro- 
vides for the logical expansion of memory space by translating 
addresses composed in a large virtual address space into 
available physical memory address. The 4gB address space 
is divided into 2gB which can be accessed by both the users 
and the kernel, and 2gB for the kernel only. 


The TLB (Translation Lookaside Buffer) 

| Virtual memory mapping is assisted by the Translation 
Lookaside Buffer (TLB). The on-chip TLB provides very fast 

virtual memory access and is well-matched to the require- 

ments of multi-tasking operating systems. The fully-associa- 
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tive TLB contains 64 entries, each.of which maps a 4kB page, 
with controls for read/write access, cacheability, and process 
identification. The TLB allows each user to access up to 2gB 
of virtual address space. 

Figure 5 illustrates the format of each TLB ening The 
Translation operation involves matching the current Process 
ID (PID) and upper 20 bits of the address against PID and VPN 
(Virtual Page Number) fields in the TLB. When both match (or 
the TLB entry is Global), the VPN is replaced with the PFN 
(Physical Frame Number) to form the physical address. 

TLB misses are handled in software, with the entry to be re- 
placed determined by as imple RANDOM function. The rou- 
tine to process a TLB miss in the UNIX environment requires 
only 10-12 cycles, which compares favorably with many CPUs 
which perform the operation in hardware. 
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ENTRYHI 


ENTRYLO 


VPN — Virtual Page Number 
TLBPID — Process ID 
PFN — Physical Frame Number 
A Non-cacheable flag 

— Dirty flag (Write protect) 


vo Valid entry 


flag 


G — Global flag {ignore PID) 


0 — Reserved 
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Figure 5. TLB Entry Format 


IDT79R3000 Operating Modes 

The IDT79R3000A has two operating modes: User mode 
and Kernel/mode. The IDT79R3000A normally operates in 
the User mode until an exception is detected forcing it into the 


Kernel mode. It remains in the Kernel mode until a Restore | 


From Exception (RFE) instruction is executed. The manner in 
which memory addresses are translated or mapped depends 
on the operating mode of the IDT79R3000A (Figure 6) shows 
the MMU translation performed for each of the operating 
modes. 


IDT79R3000A/AE RISC CPU PROCESSOR 


OxFFFFFFFF 
KERNEL 
MAPPED 
CACHEABLE 
(kseg2) 
0xC0000000 


KERNEL 
UNMAPPED 
UNCACHEED 
(kseg1) 
0xA0000000 


KERNEL 
UNMAPPED 
CACHED 
(kseg0) 
0x80000000 


0x7 FFFFFFFF 


KERNEL/USER 
MAPPED 
CACHEABLE 
(kuseg) 
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OxFFFFFFFF 


PHYSICAL 


0x20000000 
0x1 FFEFFFF 
MEMORY 512 MB 


0x00000000 
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Figure 6. IDT79R3000A Virtual Address Mapping 


User Mode—in this mode, a single, uniform virtual ad- 
dress space (kuseg) of 2gB is available. Each virtual address 
is extended with a 6-bit process identifier field to form unique 
virtual addresses. All references to this segment are mapped 
through the TLB. Use of the cache for up to 64 processes is 
determined by bit settings for each page within the TLB 
entries. 

Kernel Mode—four separate segments are defined in 
this mode: 

* kuseg—when in the kernel mode, references to this seg- 
ment are treated just like user mode references, thus 
streamlining kernel access to user data. 

* ksegO—references to this 512mB segment use cache 
memory but are not mapped through the TLB. Instead, they 
always map to the first 0.5gB of physical address space. 

* ksegi—references to this 512mB segment are not mapped 
through the TLB and do not use the cache. Instead, they 
are hard-mapped into the same 0.5gB segment of physical 
address space as kseg0. 

* kseg2—references to this 1gB segment are always mapped 
through the TLB and use of the cache is determined by bit 
settings within the TLB entries. 


IDT79R3000 Pipeline Architecture 

The execution of a single IDT79R3000A instruction con- 

sists of five primary steps: 

1) IF |©— Fetch the instruction (l-Cache). 

2) RD — Read any required operands from CPU 
registers while decoding the instruction. 

3) ALU — Perform the required operation on 
instruction operands. 

4) MEM— Access memory (D-Cache). 

5) WB — Write back results to register file. 

Each of these steps requires approximately one CPU 
cycle, as shown in Figure 7 (parts of some operations overlap 
into another cycle while other operations require only 1/2 
cycle). 





F RD ALU | MEM WB 
Treas [RE] oF [crore | wa 
eee | 


One Cycle 
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Figure 7. IDT79R3000A Instruction Pipeline 
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INSTRUCTION EXECUTION 


The IDT79R3000A uses a 5-stage pipeline to achieve an 
instruction execution rate approaching one instruction per 28 
CPU cycle. Thus, execution of five instructions ata time are ~ a Data Address 
overlapped as shown in Figure 8. | | 





Microprocessor 
. (CPU) | 


(5-Deep) 


TF [ao [ato ew 


Memory 
(and 1/O) 










we 
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Figure 9. A Simple Microprocessor Memory System 


on 


Instruction 
Flow 


Figure 10 illustrates a memory system that supports the 
significantly greater memory bandwidth required to take full 
advantage of the IDT79R3000A's performance capabilities. 
The key features of this system are: 
¢ External Cache Memory—Local, high-speed memory 

(called cache memory) is used to hold instructions and data 
















wie | mo [ALU] WeN We 





atte that is repetitively accessed by the CPU (for example, 
Cycle within a program loop) and thus reduces the number of 


ee Pi references that must be made to the slower-speed main 
Figure 8. IDT79R3000A Execution Sequence | 










Ba ate iy Seat oe een a IDT79R3000A 
This pipeline operates efficiently because different CPU Microprocessor 
resources (address and data bus accesses, ALU operations, 
register accesses, and so on) are utilized on a non-interfering 


basis. 


Address 


= Instruction 
Data 


Cache 


Memory System Hierarchy os 

The high performance capabilities of the IDT79R3000A 
processor demand system configurations incorporating tech- 
niques frequently employed in large, mainframe computers 
but seldom encountered in systems based on more traditional 
microprocessors. ee eels | 
- Aprimary goal of systems employing RISC techniques is 
to minimize the average number of cycles each instruction 
requires for execution. In order to achieve this goal, RISC 
processors incorporate a number of RISC techniques, includ- 
ing a compact and uniform instruction set, a deep instruction 
pipeline (as described above), and utilization of optimizing 
compilers. Many of the advantages obtained from these 
techniques can, however, be negated by an inefficient memory 
system. | 

Figure 9 illustrates memory in a simple microprocessor 
system. In this system, the CPU outputs addresses to memory 2860 drw 10 
and reads instructions and data from memory or writes data to Figure 10. An IDT79R3000A System with a 
memory. The address space is completely undifferentiated: High-Performance Memory System 
instructions, data, and I/O devices are all treated the same. In 
such asystem, a primary limiting performance factor is memory 
bandwidth. 
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memory. Some microprocessors provide a limited amount 
of cache memory on the CPU chip itself. The external 
caches supported by the IDT79R3000A can be much 
larger; while a small cache can improve performance of 
some programs, significant improvements for a wide range 
of programs require large caches. 

Separate Caches for data and Instructions—Even with 
high-speed caches, memory speed can still be a limiting 
factor because of the fast cycle time of a high-performance 
microprocessor. The IDT79R3000A supports separate 
caches for instructions and data and alternates accesses 
of the two caches during each CPU cycle. Thus, the 
processor can obtain data and instructions at the cycle rate 
of the CPU using caches constructed with commercially 
available IDT static RAM devices. 

In order to maximize bandwidth in the cache while minimiz- 
ing the requirement for SRAM access speed, the 
IDT79R3000A divides a single-processor clock cycle into 
two phases. During one phase, the address for the data 
cache access is presented while data previously addressed 
in the instruction cache is read; during the next phase, the 
data operation is completed while the instruction cache is 
being addressed. Thus, both caches are read in a single 
processor cycle using only one set of address and data 
pins. 


Write Buffer—in order to ensure data consistency, all data 
that is written to the data cache must also be written out to 
main memory. The cache write model used by the 
IDT79R3000A is that of a write-through cache; that is, all 
data written by the CPU is immediately written into the main 
memory. Torelieve the CPU of this responsibility (and the 
inherent performance burden) the IDT79R3000A supports 
an interface to a write buffer. The IDT79R3020 Write Buffer 
captures data (and associated addresses) output by the 
CPU and ensures that the data is passed on to main 
memory. 


IDT79R3000A Processor Subsystem Interfaces 


Figure 11 illustrates the three subsystem interfaces pro- 


vided by the IDT79R3000A processor: 


Cache control interface (on-chip) for separate data and 
instruction caches permits implementation of off-chip caches 
using standard IDT SRAM devices. The IDT79R3000A 
directly controls the cache memory with a minimum of 
external components. Both the instruction and data cache 


can vary from 0 to 256kB (64K entries). The IDT79R3000A » 


also includes the TAG control logic which determines 
whether or not the entry read from the cache is the desired 
data. The IDT79R3000A cache controller implements a 
direct mapped cache for high net performance (band- 
width). It has the ability to refill multiple words when a cache 
miss occurs, thus reducing the effective miss rate to less 
than 2% for large caches. When a cache miss occurs, the 
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IDT79R3000A can support refilling the cache in 1, 4, 8, 16, 
or 32-word blocks to minimize the effective penalty of 
having to access main memory. The IDT79R3000A also 
incorporates the ability to perform instruction streaming; 
while the cache is refilling, the processor can resume 
execution once the missed word is obtained from main 
memory. In this way, the processor can continue to execute 
concurrently with the cache block refill. 

Memory controller interface for system (main) memory. 
This interface also includes the logic and signals to allow 
operation with a write buffer to further improve memory 
bandwidth. In addition to the standard full word access, the 
memory controller supports the ability to write bytes and 
half-words by using partial word operations. The memory 
controller also supports the ability to retry memory ac- 
cesses if, for example, the data returned from memory is 
invalid and a bus error needs to be signalled. 
Coprocessor Interface—The IDT79R3000A features a 
tightly coupled co-processor interface in which all co- 
processors maintain synchronization with the main pro- 
cessor; reside on the same data bus as the main proces- 
sor; and participate in bus transactions in an identical 
manner to the main processor. The IDT79R3000A gener- 
ates all required cache and memory control signals, includ- 
ing cache and memory addresses for attached 
coprocessors. As a result, only the data bus and a few 
control signals need to be connected to a coprocessor. 
The interface supports three types of coprocessor instruc- 
tions: loads/stores, coprocessor operations, and proces- 
sor-coprocessor transfers. Note that coprocessor loads 
and stores occur directly between the coprocessor and 
memory, without requiring the data to go through the CPU. 
Synchronization between the CPU and external 
coprocessors |s achieved using a Phased-Lock Loop inter- 
face to the coprocessor. The coprocessor physical inter- 
face also includes coprocessor condition signals 
(CpCond(n)), which are used in coprocessor branch in- 
structions, and a coprocessor busy signal (CpBusy) which 
is used to stall the CPU if the coprocessor needs to hold off 
subsequent operations. 

Finally, a precise exception interface is defined between 
the CPU and coprocessors using the external interrupt 
inputs of the CPU. This allows a coprocessor exception, 
even if it was the result of a multi-cycle operation, to be 
traced to the precise coprocessor operation which caused 
it. This is an important feature for languages which can 
define specific error handlers for each task. © 
The interface supports up to four separate coprocessors. 
Coprocessor 0 is defined to be the system control 
coprocessor, and resides on the same chip as the CPU 
unit. Coprocessor 1 is the Floating Point Accelerator, 
IDT79R3010A. Coprocessors 2 and 3 are available to 
support an interface to application specific functions. 
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MULTIPROCESSING SUPPORT - 


The IDT79R3000A supports multiprocessing applications 
in a simple but effective way. Multiprocessing applications 
require cache coherency across the multiple processors. The 
IDT79R3000A offers two signals to support cache coherency: 
the first, MPStall, stalls the processor within two cycles of 
being received and keeps it from accessing the cache. This 
allows an external agent to snoop into the processor data 
cache. The second signal, MPInvalidate, causes the proces- 
sor to write data on the data cache bus which indicates the 
externally addressed cache entry is invalid. Thus, a subse- 
quent access to that location would result ina cache miss, and 
the data would be obtained from main memory. 

The two MP signals would be generated by a external 
logic which utilizes a secondary cache to perform bus snoop- 
ing functions. The IDT79R3000A does not impose an archi- 
tecture for this secondary cache, but rather is flexible enough 
to support a variety of application specific architecture stand 
still maintain cache coherency. Further, there is no impact on 
designs which do not require this feature. The IDT79R3000A 
has further improved on the microprocessor support found in 
the IDT79R3000, by allowing the use of cache RAMs with 
internal address latches in multiprocessor systems. 


ADVANCED FEATURES . 
The IDT79R3000A offers a number of additional features 


~ such as the ability to swap the instruction and data caches, 
_. facilitating diagnostics and cache flushing. Another feature 
“~ isolates the caches, which forces cache hits to occur regard- 


less of the contents of the tag fields. The IDT79R3000A allows 
the processor to execute user tasks of the opposite byte 
ordering (endianness) of the operating system, and further 
allows parity checking to be disabled. More details on these 
features can be found in the IDT79R3000A Family Hardware 
User's Manual. 

Further features of the IDT79R3000A are configured 
during the last four cycles prior to.the negation of the RESET 
input. These functions include the ability to select cache sizes 
and cache refill block sizes; the ability to utilize the multipro- 
cessor interface; whether or not instruction streaming is 
enabled; whether byte ordering follows “Big-Endian” or “Little- 
Endian” protocols, etc. Table 3 shows the configuration op- 
tions selected at Reset. These are further apcusSeS! in the 
Hardware User’s Manual. . 


BACKWARD COMPATIBILITY WITH 7 
IDT79R2000 


‘The IDT79R3000A can be used in sockets designed for 
the IDT79R3000. The pin-out of the IDT79R3000A has been 
selected to ensure this compatibility, with new functions 
mapped onto previously unused pins. The instruction set is 
compatible with that of the IDT79R2000 at the binary level. As 
a result, code written for the older processor can be executed. 
New features can be selectively disabled. 


_ COMMERCIAL TEMPERATURE RANGE. 


In most |DT79R3000 applications, the IDT79R3000A can 
be placed in the socket with no modification to initialization 
settings. Further application assistance on this topic.is avail- 
able dl IDT. | 


PACKAGE THERMAL SPECIFICATIONS: 


~ The IDT79R3000A utilizes special packaging techniques 
to improve both the thermal and electrical characteristics of 
the microprocessor. 

In order to improve the electrical characteristics of the 
device, the package is constructed using multiple signal 
planes, including individual power planes and ground planes 
to reduce noise associated with high-frequency TTL parts. In 
addition, the 175-pin PGA package utilizes extra power and 
ground pins to reduce the inductance from the internal power 
planes to the power planes of the PC Board. 

In order to improve the electrical characteristics of the 
microprocessor, the device is housed using cavity down 
packaging. In addition, these packages incorporate a copper- 
tungsten thermal slug designed to efficiently transfer heat 
from the die to the case of the package, and thus effectively 
lower the thermal resistance of the package. The use of an 
additional external heat sink affixed to the package thermal 
slug further decreases the effective thermal resistance of the 
package. | 

The case temperature may be measured in any environ- 
ment to determine whether the device is within the specified 
operating range. The case temperature should be measured 
at the center of the top surface opposite the package cavity 
(the package cavity is the ‘side where the package lid is 
mounted). 

The equivalent allowable ambient temperature, TA, can 
be calculated using the thermal resistance from case to 
ambient (0ca) for the given package. The following equation 
relates ambient and case temperature: 


TA = Tc - P*@ca 


where P is the maximum power consumption, calculated dey 
using the maximum loc from the DC Electrical Characteristics 
section. 
_ Typical values for @ca at various airflows are shown in 
table 4 for various CPU packages. 


R3000A PACKAGE CHARACTERISTICS 













- Airflow - (ft/min) : 
[200 | 400 | 600 | 800 | 1000} 
Oca eam 21 7 3 pete 0.5 
144-PGA) | 
@ca (172 Quad 23 4. 3° | 2.5 1.5 
Flatpack) 7 


2860 tbl 03 








IDT79R3000A/AE RISC CPU PROCESSOR COMMERCIAL TEMPERATURE RANGE 


R3000A MODE SELECTABLE FEATURES 


| Input —_| _ WCycle X Cycle Y Cycle Z Cycle 















































Into DBIkSize0 DBikSize1 Extend Cache Big Endian 

Intt IBIkSize0 IBIkSize1 MPAdrDisable TriState 

Int2 DispPar/RevEnd IStream IgnoreParity NoCache 

Int3 Reserved(’) StorePartial MultiProcessor BusDriveOn 

Int4 PhaseDelayOn'2) PhaseDelayOn(2) PhaseDelayOn'@) PhaseDelayOn'@) 
int5 R3000 Model?) R3000 Model?) R3000 Model?) R3000 Model?) 


NOTES: 2860 tbl 04 
1. Reserved entries must be driven high. 
2. These values must be driven stable throughout the enfire RESET period. 


Data Bus Data Bus 





Data Bus 

















W 
| 
as) 


AdrLo Bus AdrLo Bus 
ef AdrLo Data 
Tag DataP 
TagP 
Trans- 
IClk DCik parent 
Latch 







IDT79R3000A Processor 
with System Control 
Coprocessor 

















Data Tag |Adr 
(15:2) 


lAdr Tag Data 
(15:2) 
















__ Data 
OE Cache 


Instruction 
Cache 





Clk2xSys 







Clk2Rd 
sysOut Clk2xPhi 
Reset 








Memory 
Interface 











CpCond(3:1 
Int(5:0 









Hardware 
Interrupts 
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Figure 11. IDT79R3000A Subsystem Interfaces Example; 64 KB Caches 
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IDT79R3000A/AE RISC CPU PROCESSOR 

















_ COMMERCIAL TEMPERATURE RANGE 








PIN CONFIGURATION | - 
CONMOROTMNNS YS To) Z. Ko) oe a o~ 
Nr ON errr eer er oh -. = CTHKHONMDONTMANRAYrYOOdO 
SXSSGCKHKZKKSFGFOOOSOOOCOOOOOOOOAZSSESHSSHSSSSHStd 
THGTTCTGCTCTDTT TC GZOO G®ZOOOZZZOOZZZVIGCGCGVHSCKHHTHGO 
AN NDONODONOONOOONDOS FSO OS>S>>00 0S S00 070000000 00000ct< 
44 86 
Data21 43 87 -— AdrLo2 
Data22 C4 L—] AdrLo3 
Data24 Co -— 1 AdrLo4 
Data25 C4 ——! AdrLod 
Data26 L_— -—I AdrLo6 
Data31 C— -—! AdrLo7 
DataP3 L-— -—! AdrLo8 
Data27 -— -—I AdrLo9 
Data28 -— -—] AdrLo10 
XEn C4 -—} AdrLoi1 
Data29 C— L—} AdrLoi2 
Data30 C—~ -—— AdrLo13 
Exc C= -—] AdrLo14 
Clk2xPhi -— L——1 VCC 
GND -—1 VCC 
GND C— L— VCC 
Clk2xSmp CS +—! GND 
vcc -—] GND 
vec C— -—1 VCC 
GND C= -—1 VCC 
GND C— -——1 GND 
GND C4 -—— VCC 
vcc -—! VCC 
| VCC ——t VCC 
i. vcc 1 -—— AdrLo1i5 
( GND — -—1 CpCondd 
| GND pd CoCond1 
| Cik2xSys + -—1 Resvdt 
| IRd1 + — + GND 
DRd1 -—! GND 
Wri -———I AdrlLo16 
DWri -— -——! AdrLo17 
VCC -—! Into 
vec -— Inti 
Clk2xRd C4 r—I Int2 
SysOut -— -—! Int3 
DCik C— -—! Int4 
JCik b— -— Intd 
IRd2 -—1 CoBusy 
DRd2 C— -—1 WrBusy 
IWr2 oS —— RdBusy 
DWr2 -— ——_ BusError 
MemWr — 1 129 +— Reset 
fro TOUUUUUUUUUU OOOO 
ES Batons SVBIOILLS ke Oe 8 VOODOO ON A geenee eran : 
EG ERS SP PRP LOG PI> PUGS E> VOSGSSSVIPPPISP PIP — secoawiz 
®QiagdQ 899 RREEFEF Fre FG a FRR RRRREERFREEE ; 
=> O oo°0 ® 
<tIat< or 
NOTES: 
1. Reserved pins must not be connected. 
2. AdrLo 16 and 17 are multifunction pins which are controlled by mode select programming on interrupt pins at reset time 
AdrLo 16: MP Invalidate, CpCond (2). 
AdrLo 17: MP Stall, CpCond (3). 
; 172-Pin Flatpack (Top View) 
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IDT79R3000A/AE RISC CPU PROCESSOR COMMERCIAL TEMPERATURE RANGE 


PIN aaa ttaa 
Reset 


Tag12 | Tag15 


oo 
=z 
Zo 


Tagi3 | TagP0 | Tag18 


ND | Tagi4 Tag19 


VCC | Tagi6 | Tag20} VCC 


VCC | GND | Tag22 | TagP1 


< 
© 


GND Tag25 | Tag24 
Tag28 | Tag29 | Tag26 





VCC 


c 
< 


> 
GQ 
O° 





NOTE: 2860 drw 13 


1. AdrLo 16 and 17 are multifunction pins which are controlled by mode select programming on interrupt pins at reset time 
AdrLo 16: MP Invalidate, CpCond (2). 
AdrLo 17: MP Stall, CpCond (3). 


175-Pin PGA (Top View) 


5.1 13 


IDT79R3000A/AE RISC CPU PROCESSOR 


PIN dete 
1 


cc 


pale DAd2 oa Ad sc a sg CpCond) 
ae VCC AdtLo | GND | GND | VCC 
ae oa 


wo 


O 
on 


oO fp O 


oe oar Data | GND 
19 
vee Data Data | Data 
22 26 
a Data Data | Data 


aon 





.COMMERCIAL TEMPERATURE RANGE 


nae ate 


Tagi7 Tage 


Tagi6 Tag20 | VCC 


GN 


Oo 


Tag21 | Tag23 


Tag22 TagP1 


VC 


Q) 


Tag 25 | Tag24 


Tag28 | Tag29 | Tag26 


GND Tag27 


> 
oO 
OQ 
cae | 
pee) 
© 
Ww 
— oe 


Ko} 
< i@) 


| 

< 

T 
No 


@ 

Zz 

O 
= 
<> 
28 


= 
Dio 
O15 
g| 
Cc 
5 


s 
se 
> 
[@) 

ced 
> 

;< 
38 
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. AdrLo 16 and 17 are multifunction pins which are controlled by mode select programming on interrupt pins at reset time. 


AdrLo 16: MP Invalidate, CpCond (2). 
AdrLo 17: MP Stall, CpCond (3). 


144-Pin PGA (Top View) 
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IDT79R3000A/AE RISC CPU PROCESSOR COMMERCIAL TEMPERATURE RANGE 





a 
z 
s) 
Mm 
7) 
e) 
a 
U 
4 
O 
z 
7) 


Pin Name Vo Description 


/O 


Data (0-31) 


A 32-bit bus used for all instruction and data transmission among the processor, caches, memory interface, 
and coprocessors. 





DataP (0-3) /O | A 4-bit bus containing even parity over the data bus. 


(12-31) | V/O | A 20-bit bus used for transferring cache tags and high addresses between the processor, caches, and memory 
interface. 


I/ The tag validity indicator. 


A 3-bit bus containing even parity over the concatenation of TagV and Tag. 


An 18-bit bus containing byte addresses used for transferring low addresses from the processor to the caches 
and memory interface. (AdrLo 16: CpCond (2), AdrLo 17: CpCond (8) set by reset initialization). 


Read enabie for the instruction cache. 


4 
pe) 
ca 


= 
ped) 
a 
; 


(or 
ine) 






(0-2) 
(0-17) 





| >|o4 
Q |» 
= 
= {ca 
oj}, Uv 





| D 
Q 


z 


Write enable for the instructon cache. 
An identical copy of IRd1 used to split the load. 


z 
N 


An identical copy of lWr1 used to split the load. 






The instruction cache address latch clock. This clock runs continuously. 


The read enable for the data cache. 


The write enable for the data cache. 


Ol S 
aS 


An identical copy of DRd1 used to split the load. 
An identical copy of DWr1 used to split the load. 








The data cache address latch clock. This clock runs continuously. 
Ord The read enable for the Read Buffer. 


A 3-bit bus used to indicate the size of data being transferred on the data bus, whether or not a data transfer is 
occurring, and the purpose of the transfer. 


AccTyp(0-2) 





Signals the occurrence of a main memory write. 


MemRd Signals the occurrence of a main memory read. 
BusError Signals the occurrence of a bus error during a main memory read or write. 
Run 


Indicates whether the processor is in the run or stall state. 


| SysOut_ | 
_Spsync | 
p 








ndicates that the instruction about to commit state should be aborted and other exception related information. 





A reflection of the internal processor clock used to generate the system clock. 





A clock which is identical to SysOut and used by coprocessors for timing synchronization with the CPU. 


The main memory read stall termination signal. In most system designs RdBusy is normally asserted and is 
deasserted only to indicate the successful completion of a memory read. RdBusy is sampled by the processor 
only during memory read stalls. 


RdBusy 


sOut 
sync 
(0-5) 





The main memory write stall initiation/termination signal. 
The coprocessor busy stall initiation/termination signal. 
CpCond (0-1 


1) 


A 2-bit bus used to transfer conditional branch status from the coprocessors to the main processor. 


Conditional branch status from coprocessors to the processor. Function is provided on AdrLo 16/17 pins and is 
selected at reset time. 


Multiprocessing Stall. Signals to the processor that it should stall accesses to the caches in a multiprocessing 
environment. This is physically the same pin as CpCond3; its use is determined at RESET initialization. 


MPStall 
MPInvalidate 


Int 


Multiprocessing Invalidate. Signals to the processor that it should issue invalidate data on the cache data bus. 
The address to be invalidated is externally provided. This is the same pin as CpCond2; its use is determined at 
RESET initialization. 


A 6-bit bus used by the memory interface and coprocessors to signal maskable interrupts to the processor. At 
reset time, mode select values are read in. 
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IDT79R3000A/AE RISC CPU PROCESSOR 


PIN DESCRIPTIONS (Continued) 















leading edge of SysOut. 


ABSOLUTE MAXIMUM RATINGS": 9) 
[Symbol] Rating _| Commercial, 


Terminal Voltage | —0.5 to +7.0 
with Respect 
















0 to +704 
(Ambient) 


0 to +90!) 


Operating 
Temperature 









Under Bias 0 to +90) 
Temperature - 





Input Voltage -0.5 to +7.0 





NOTE: 


1. Stresses greater than those listed under ABSOLUTE MAXIMUM RATINGS | 


may cause permanent damage to the device. This is a stress rating only 
and functional operation of the device at these or any other conditions 


above those indicated in the operational sections of this specificationis not 
implied. Exposure to absolute maximum rating conditions for extended — 


periods may affect reliability. 
2. -ViNminimum = —3.0V for pulse width less than 15ns. 
Vin should not exceed Vcc +0.5 Volts. 
3. Notmore than one outputshould be shorted ata time. Duration of the short 
should not exceed 30 seconds. 
. 16-33 MHz only. 
. 37-40 MHz only. 


ak 


AC TEST CONDITIONS 


Pv | Input HIGH Voltage 
sd Input LOW Voltage 


Input HIGH Voltage 
Input LOW Voltage . 
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pees [Fee 





| ClkaxSys | | The master double frequency input clock used for generating SysOut. 

Clk2xSmp A double frequency clock input used to determine the salle point for data eormng into the processor ane 
coprocessors. on | 

| Clk2xRd | | Cik2xRd | st A double frequency clock input’ used to determine the enable time of the cache RAMs. 


Clk2xPhi Tr A double frequency clock input used to determine the position of the internal phases, phase1 and phase2. 


Synchronous initialization input used to force execution starting from the reset memory address. Reset must be 
deasseted synchronously but asserted asynchronously. The deassertion of Reset must be synchronized by the 
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COMMERCIAL TEMPERATURE RANGE 
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RECOMMENDED OPERATING 


TEMPERATURE AND SUPPLY VOLTAGE 











[Grade | Temperature | GND | Veo 
Commercial 0°C to +70°C OV 5.0 +5% 
16-33 MHz (Ambient) 

Commercial — 0°C to +90°C ov 5.0 +5% 
40 MHz (Case) 
a 2860 tbl 09 


OUTPUT LOADING FOR AC TESTING 


To Device 
Under Test. 
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IDT79R3000A/AE RISC CPU PROCESSOR COMMERCIAL TEMPERATURE RANGE 


DC ELECTRICAL CHARACTERISTICS 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Vcc = +5.0V +5%) 


79R3000A 79R3000AE 
Symbol Parameter Test Conditions 


16.67MHz | 20.0MHz | 25.0MHz_ | 33.33MHz_| 
Output HIGH Voltage Vcc = Min., IOH = -4mA 
Output LOW Voltage Vcc = Min., lo. = 4mA 
1H 
VIL 
CIN 
















=r 
Voc = Min. lox = -4mA 
[Vout __| Output LOW vottagel#®) | Vcc= Min, Iou=8ma | — | 
[Vin [inputHiGHVottage®) | | 20 
[Vi | Input owvottage() | 
Input HIGH Voltage(2,5) a 
inputLOW Voltage) fo 
[cw | Input Capacitance | 
Cour | Output Capacitancel®) | 
icc___| Operating Gurent__—| Voo=5V,Ta=70°C | 
Fic [np tow teatage®™) [vi eno eof — 
Output Tri-state Leakage —100 | 100 } -100 
NOTES: 


1. Vit Min. = —3.0V for pulse width less than 15ns. Vit should not fall below —0.5 Volts for larger periods. 

2. ViHs and ViLs apply to Clk2xSys, Clk2xSmp, Cik2xRd, Clk2xPhi, CpBusy, and Reset. 

3. These parameters do not apply to the clock inputs. 

4. VoHTand VoLtT apply to the bidirectional data and tag busses only. Note that ViH and Vit also apply to these signals. VoHT and VoLT are provided 
to give the designer further information about these specific signals. 

ViH should not be held above Vcc + 0.5 volts. 

Guaranteed by design. 

VoHc applies to RUN and Exception. 
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: VoLT Output LOW Voltage “Vec = Min., lol = 8mA | 


IDT79R3000A/AE RISC CPU PROCESSOR ae COMMERCIAL TEMPERATURE RANGE 


DC ELECTRICAL CHARACTERISTICS Lae yet EF wp Be fe 
COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to +90°C, Vcc = +5.0V 45%) | 
a | 79R3000AE 























' -. Parameter 





| Test Conditions 
Vcc = Min., loH = -4mA 


Output HIGH Voltage 


= 
= | 
Cc 
=. 
~~ 





Output LOW Voltage Vcc = Min., lol = 4mA 
VoHc | Output HIGH Voltage” Vcc = Min., lok = -4mA. 


VOHT 





Output HIGH Voltage(4#®) | Vcc = Min., OH =-8mA 


nm | a wo| = 
R/O on i 


oe) 


2 
3 






oe) 


= oe 


COUT Output Capacitance 
Operating Current Vec=5V,TA=70°C © 


850 © mA 
100 | pA 


as 


Input HIGH Leakage) ViH=VCC : 
Input LOW Leakage’? Vil = GND | —100 7 
Output Tri-state Leakage | VOH = VCC, VoL = GND ~ -100 7 100, LA 


2860 tbl 12 


VoH | 
VOL | 
CS) 
ce 
tL 
ue 
2) 
IN j (6) 
(6) 
Ico | 
HH ) 
IL ) 
OZ. a | 





© 





NOTES: 


1. Vit Min. = —3.0V for pulse width less than 15ns. Vi should not fall below —0.5 Volts for larger periods. —_- 

2. ViHs and VILs apply to Clk2xSys, Clk2xSmp, Clk2xRd, Clk2xPhi, CpBusy, and Reset. 

3. These parameters do not apply to the clock inputs. i ; 

4. VoHTand VOLT apply to the bidirectional data and tag busses only. Note that ViH and Vit also apply to these signals. VoHT and VoLT are provided 
to give the designer further information about these specific signals. 

ViH should not be held above Vcc + 0.5 volts. 

Guaranteed by design. 

Vouc applies to RUN and Exception. 
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IDT79R3000A/AE RISC CPU PROCESSOR COMMERCIAL TEMPERATURE RANGE 


AC ELECTRICAL CHARACTERISTICS(1:2:3) 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Vcc = +5.0V +5%) 


| : 79R3000A 79R3000AE 
| 25.0MHz_ | 33.33MHz_| 
Symbol Parameter 


16.67MHz 
Clock 

















Test Conditions 








Input Clock HIGH) — | — [so] — | 6o| — | 
Input Clock LOW | — jsol| — |s6o| — | 
Top | Input Clock Period() 15 
Clk2xSys to cik2xSmps 0 0 
Clk2xSmp to Clk2xRd 6) 0 0 
Clk2xSmp to Clk2xPhi(®) yc/4.| 5.0 3.5 








Run Operation 

ToEn | Data Enable he cee, ete 

Topis_ | Data Disable(9) Bete 
(1: 7 


wo 
© 
| 
~ 
=) 
| 
—k 
or 


N cn} oo | | 
=) COlClo | 


Twa 
5s _|Data Setup S| 
ov [DataHo | 

[cpBusy Setup S| 

a 


> 
w” 


= 
on 


wo i) mimo] dg 


ie) 






| 
ne) 
on 


| 
a mp fi 
on wn | O 


oBH 
acty [Access Type (1:0) | Load=25pF 
Tare 
Twwe [Memory Write | Load= 25pF | 
Téxe 
Tavel [Address Vaid __———‘[ Load=25pF __| 
Tints | Int(n) Set-up Pe ee el 
Soin 


TintH =| Int(n) Hold 
Stall Operation 


ius) 


4)a/ajala 
(e) 
ive) 
op) 





Ss 
~ 


et wlrmid [i 
Nw COlOlala 


—_ 
© 


NI] p 
oOI™N 





N 
oO 
— 
oO 





I | 
|| 
oi (Oa 
1 
NS IO] 
(63) ol Oo 





| | 

ae wre 

on on 
NIN] ro f= wo] 
O1o/C];F Clo 


ah 
on 
| 
N 
on 


| 
ed 
on 










[Tsavel[Adcress Vaid _——=édi toad aor «| — | 0 [| 2 [—| © [—]| 6] ms | 
[Tencry| Access Type =i Load eprF «| — ‘| 27 | — | 23 | —| te | — [135] ns | 
Tura [Memory Read initiate __—‘[ toad=2epF__—«| 4.0. | a7 [ao] 23 [io] ve | to [is] ns 
[Tei [RunTerminate =i toad= opr 3.0 | 7 [0 1 [so] to [20[ 75] ns _| 
[Tron [Runiniiate ————SS—«d Load=2epF_—Ss[ — | 79 [— [eo] —|[ 40 [—]|so]| re | 
load=25pF «| 30 | a7 [30] 23 | a0] we | 20] 95 | ns 
[Tsexe [Exception Valid —~=~=«édi Load=2epF_—=s«dP — ~+| ts | — | 13 | —]| 10 | —| 75] ns | 
Reset Initialization 

|Tast_[ResetPulsewidth sss | CO | — | oO | — [ool — | bo] — | Toye | 
| Trstput| Reset timing, Phase-lockon(#°) | 3000 | _— | 3000] — | 3000] — | 3000] — | Toyo _ 


Capacitive Load Deration 


|CLD_ |loadDerate') tl | Of OS | 10 | oS] 10] oO | 1.0 [ns/25pF| 
NOTES: 2860 tbl 13 


. All timings are referenced to 1.5V. 

. The clock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Cik2xPhi. 

. This parameter is guaranteed by design. 

. These parameters apply when the IDT79R3010 Floating Point Coprocessor is connected to the CPU. With phase lock on, Reset must be asserted 
for the longer of 3000 clock cycles or 200 microseconds. 

. Tcyc is one CPU clock cycle (two cycles of a 2x clock). 

. With the exception of the Run signal, no two signals on a given device will derate for a given load by a difference greater than 15%. 

. Clock transition time < 2.5ns for 33.33MHZz; clock transition time < 5ns for other speeds. 


&OND — 


NOM 
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COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to +90°C, Voc = +5.0V.45%) 


Symbol} 


Hews Input Clock High?) 
‘| TcokLow| Input Clock Lowi?) 


| Trstep | Reset timing, Phase-lock off(4.5) 
[ctb [toad Derate®) ee 


‘IDT79R3000A/AE RISC CPU PROCESSOR COMMERCIAL TEMPERATURE RANGE 





AC ELECTRICAL CHARACTERISTICS("1:2:3) 













79R3000AE 












Test Conditions 





Parameter 





Clock 














a} 
i) 

















Input Clock Period(2) ns. 
Cik2xSys to cikaxsmp™) ns 
Clk2xSmp to Clk2xRd 6) ns 
| Clk2xSmp to Clk2xPhi(®) ns 





Zz 
212 
© |o 
NIN 


Ly} fe 
on 
p= | 
n” 


Run Operation | 


Torn | Data Enable) 





| 
on 
a 


=] 
” 


ppis_| Data Disable Oe 
Load= 25pF fae gem eae am 


wrDly | Write Delay 


4 


DH Data Hold 
cas |CpBusy Setup i 
i 


4 


ap 


os_|DataSetup 


TcosH | CpBusy Hold 


Load= 25pF | 


Access Type (1:0 


| 


Tac i AT 
Tave_| Access Type 2 Loads 250F 
Tur cad 259F 


c 


Exc oad=25pF | 8 
TAval__| Address Valid Oad= 25pF . ane ale 


Tints | Int(n) Set-up 


TintH | Int(n) Hold 


Stall Operation — 








| 

NO 

on 
S15 fo > 3 
A 1M [HM 1H D | 


| 
nm |: 
o1 


















load=25pF | CT CY 
TsMWwr 
TSExe | Exception Valid 


Reset Initialization 


Trst_| Reset Pulse Width = Bee Ys 


TrstPLL | Reset timing, Phase-lock on(45) 


oad= 25pF 


|Tsaval |Address Valid | Load=25pF | ns 
Access Type [loads 25pF | ts 
| Turci_|Memory Read Initiate | Load= 25pF a ee eee eae Oe 
, R ade 25pF | COT ts 
si__|RunTerminate | Load 25pF_— | | ts 
Run |_ns.| 
| 


MRdt | Memory Read Terminate 


_ {| _ _ | 
Oo 


3 


oad=25pF 





<= . 
oO 1O |O FT 


ie) 
oO 
jo) 
oO 





Capacitive Load Deration 





a 
nN 
on 
row 
nl 


. 2860 tbl 15 
NOTES: 

1. All timings are referenced to 1.5V. . bs 

2. The clock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Clk2xPhi. 

3. This parameter is guaranteed by design. ; . . ae 

4. These parameters apply when the IDT79R3010 Floating Point Coprocessor is connected to the CPU. With phase lock on, Reset must be asserted 

for the longer of 3000 clock cycles or 200 microseconds. | 

Tcyc is one CPU clock cycle (two. cycles of a 2x clock). . 

With the exception of the Run signal, no two signals on a given device will derate for a given load by a difference greater than 15%. 

Clock transition time < 2.5ns. 


NOM 
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INPUT CLOCK TIMING 
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PROCESSOR REFERENCE CLOCK TIMING 


Tcyc 


Tsys | . 2860 drw 18 


SysOut 





SmpOut " 


RdOut’” 


PhiOut "” 


NOTE: 
1. These signals are not actually output from the processor. They are 
drawn to provide a reference for other timing diagrams. 
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SYNCHRONOUS MEMORY (CACHE) TIMING 
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MEMORY WRITE TIMING 
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MEMORY READ TIMING 
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COPROCESSOR LOAD/STORE TIMING 
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INTERRUPT TIMING 
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Tint. § : 3 Tint S 2860 drw 23 
MODE VECTOR INITIALIZATION 
Phase ~ | 1 | 2 | 1 | 2 | 1 | 2 | 1 | 2 | 1 | 2 | 1 
SysOut - hh as ae Tevs |/ Tavs 
an KOK KOKO 
sal er rl Tint H Tint H wall TintH | = 
Tint S Tint S Tint S nt S 
Reset 
haa 
2860 drw 24 
NOTES: 


1. Reset must be negated synchronously; however, it should be asserted asynchronously. Designs must not rely on the proper functioning of SysOut prior 


to the assertion of Reset. 

2. If Phase-Lock On or R3000 Mode are asserted as mode select options, they should be asserted throughout the Reset period, to insure that the slowest 
coprocessor in the system has sufficient time to lock to the CPU clocks. 

3. Reset is actually sampled in both Phase 1 and Phase 2. To insure proper initialization, it must be negated relative to the end of Phase 1. 
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ORDERING INFORMATION 
IDT = _XXXXX XK XK 
Device Type Speed Package Process/ 
Temperature 
Range 
| Blank 
G175 
G144 
16 
20 
25 
33 
40 
79R3000A 
79R3000AE 





VALID COMBINATIONS 
IDT 79R3000A - 16, 20 All Packages 
79R3000AE - 25, 33, 40 All Packages 
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Commercial (0°C to +70°C) 


175-Pin PGA (Cavity Down) 
144-Pin PGA (Cavity Down) 
172-Pin Flat Pack (Cavity Down) 


16.67 MHz 
20.0 MHz 
25.0 MHz 
33.33 MHz 
40.0 MHz 


RISC CPU Processor 
Enhanced Timing Version 
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FEATURES: 

¢ Enhanced Instruction. Set compatible version of 
IDT79R3000 RISC CPU 

¢ Achieves high-performance with reduced parts count 
and lower overall system cost - 7 

¢ Flexible on-chip cache controller supports various 

cache, main memory sizes 

Supports optional data parity with parity error output 

signal 

Works with IDT79R3010A RISC Floating-Point 

Coprocessor 

DMA interface support | 

Large synchronous memory space for real-time systems 

Full 32-bit operations — 32-bit registers, 32-bit address 

and data interface 

On-chip memory management unit with 64 fully-associa- 

tive TLB entries maps 4GB virtual address space 

High-speed interrupt response (6 interrupt input pins) 

with precise exception capability 

High-speed CMOS technology results in speeds from 

12.5 to 40MHz 


e 


e 


* Supports caches from 8kB to 16MB 


¢ Independent block refill sizes for the instruction and data 
caches | 

¢ Concurrent cache refill and execution 

¢. Works on 8-, 16- and 32-bit data 

¢ Supports unaligned 32-bit data 

¢ Optimizing compilers for C, Ada, Pascal, Fortran, others 

¢ RTOS support for C or Ada environments 


DESCRIPTION: ; 

The IDT79R3001 brings the high-performance inherent in 
the IDT79R3000 RISC Microprocessor to lower cost systems. 
It does this while maintaining full (both User and Kernel) 
software compatibility with both the IDT79R2000A and 
IDT79R3000 RISC Microprocessors. 

The IDT79R3001 achieves lower system cost by reducing 
the number of components required to construct a synchro- 
nous memory (or cache) external to the processor and by 
simplifying the asynchronous memory interface. By removing 
the requirement for parity and allowing the system designer to 
select the cache. organization which best suits the system, 
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overall parts count is dramatically reduced while maintaining 
high performance. 

The IDT79R3001 RISC Microprocessor extends the ability 
of the IDT79R3000 family to support embedded and cost 
sensitive applications. Its level of integration and flexibility 
allows high-performance systems to be constructed at rea- 
sonable cost in a straightforward manner, without forcing the 
system designer to support features not required in his appli- 
cation. 

The IDT79R3001 consists of two tightly coupled proces- 
sors integrated ona single chip. The first processor is a full 32- 
bit CPU based on RISC principles to achieve a new standard 
of performance in microprocessor based systems. The sec- 
ond processor is a system control co-processor, called CPO, 
containing a fully associative 64-entry TLB (Translation 
Lookaside Buffer), MMU (Memory Management Unit), and 
control registers, supporting a 4GB virtual memory subsystem 
and a Harvard Architecture Synchronous Memory/Cache 
controller which achieves ultra-high bandwidth using industry 
standard SRAM devices. 

This data sheet provides an overview of the features and 
architecture of the IDT79R3001 CPU. A more detailed de- 
scription of the operation and timing of this device is incorpo- 
rated in the /DT79R3001 Hardware User's Guide, and a de- 
tailed architectural overview is provided in the M/PS RISC 
Architecture book, both available from IDT. Further literature 
describing the hardware, software, and development tools for 
the IDT79R3001 is also available from IDT. 


HARDWARE OVERVIEW 

The IDT79R3001 is a high-performance RISC micropro- 
cessor incorporating a fast execution engine and sophisti- 
cated yet flexible memory interface designed to support the 
processor bandwidth requirements at minimal system cost. 


Execution Engine 

The IDT79R3001 contains the same basic execution en- 
gine as the ultra-high performance IDT79R3000 and thus 
achieves over 28 MIPS performance at 33MHz. 

The key to the performance of the processor is the instruc- 
tion pipeline, illustrated in Figure 2. The execution of a single 
IDT79R3001 instruction consists of five primary steps, some 
of which may be broken down further into smaller subsets. 

The five primary stages of the pipeline, each of which 
require approximately one CPU cycle, are: 


IF — Instruction Fetch, when the processor fetches the 
instruction from the Instruction Synchronous Memory. 


RD Readrequired operands from on-chip register file while 
decoding the instruction. 


ALU Perform the required operation on instruction 
operands. 


MEM Access data memory (load or store). 


WB Write results back to register file. 





: des WB 


SS 
One Cycle 





2873 drw 02 
Figure 2. IDT79R3001 Five-Stage Pipeline 


Thus, the CPU achieves an average execution rate ap- 
proaching one instruction per CPU cycle, since the execution 
of five instructions at a time are overlapped within the proces- 
sor (Figure 3). Optimizing compiler technology fully compre- 
hends the interaction of software with the various pipeline 
resources, and serves to both eliminate any potential pipeline 
conflicts which might arise and to maximize instruction 
throughput. 


The IDT79R3001 Memory Interfaces 

~The key to achieving the inherent performance of the 
IDT79R3001 is to design a memory subsystem capable of 
providing a new instruction to the processor on almost every 
clock cycle. 

Like the IDT79R3000, the IDT79R3001 supports a hierar- 
chical view of the memory subsystem. However, the 
IDT79R3001 allows the system designer to make more trade- 
offs in the partitioning and architecture of the various levels in 
order to more completely meet the needs of certain types of 
applications. 

The IDT79R3001 supports two classifications of external 
memory: synchronous and asynchronous. The Harvard- 
Architecture (Separate instruction and data memories) syn- 
chronous memory allows the processor to achieve the highest 
levels of performance. The processor is able to obtain both an 
instruction and data word from the synchronous memory on 
every clock cycle, resulting in high instruction and data 
throughput. 
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Figure 3. Instruction Execution in IDT79R3001 Pipeline 
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The asynchronous memory space contains larger, slower 
memory devices such as EPROM, main memory DRAMs, 
and peripheral devices. Multiple clock cycles are required for 
data movement in the asynchronous memory. 

Many systems implement a memory hierarchy between 
these two memory spaces, whereby the synchronous memory 
space is used as processor caches and the asynchronous 
memory space is used for main memory. The IDT79R3001 
integrates a flexible Direct-Mapped Cache Controller On- 
Chip, eliminating external cache control logic and minimizing 
cache management overhead. If the synchronous memory 
space is used for processor caches, then cache “misses” will 
cause the processor to automatically process an asynchro- 
nous memory transfer to refill the cache. 


The key to. achieving the system cost and performance © 


goals of an IDT79R3001-based system is to partition the 
memory system to the needs of the application. 
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Synchronous seme System ; 

As with any high-performance processor, the IDT79R3001 
requires high-bandwidth to achieve high-performance. Thus, 
it is important that the majority of its execution occur in the 
synchronous memory space. In applications which require 
substantial amounts of main memory, this memory space will 
be implemented as instruction and data caches. 

The synchronous memory is designed to be able to supply 
both an instruction and data word to the processor on each 
clock cycle. When the synchronous memory spaces are used 
as caches, then they are used to hold instruction and data that 
is repetitively accessed by the CPU (for example, within a 
program loop). This reduces the number of slower asynchro- 
nous memory cycles and thus achieves higher performance. 

Some microprocessors incorporate small amounts of cache 
on-chip, which has a very small and unpredictable effect on 
the execution of large programs. The IDT79R3001 supports 


1 
(Instruction 
Read) 


“2 
(Data Store) 


! CPU Data Pins | 
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Instr. RAM 


Figure 4. Synchronous Memory Control Timing 


caches of from 8kB in size up through 16MB, thus bringing 
substantial performance improvements to very large pro- 
grams and also allowing real-time system designers to design 
cache-based systems to support deterministic requirements. 

The IDT79R3001 directly controls the synchronous memory 
interface (whether it is being used as caches or not) with a 
minimum of external components. The IDT79R3001 includes 
all control signals and cache TAG control logic (for a direct 
mapped cache) forthe synchronous memory interfaces. Parity 
over the data portion of each synchronous memory can be 
optionally selected at RESET time for applications which 
desire to make this cost trade-off. 


The synchronous interface works by dividing the basic 
CPU cycles into two phases. During one phase, a cache 
address is presented by the processor and captured by 
external latches (the latch control ‘signals are directly gener- 
ated by the CPU). During the next phase, the address for the 
other memory space is generated and captured while the data 
movement operation or the first cache is completed. The 
processor directly generates the SRAM Output Enable and 
Write Enable signals and the address latch enable signals, 
requiring no external decoding. This is illustrated in Figure 4. 

Further, the IDT79R3001 supports the ability to refill mul- 
tiple words into the cache from main memory when a cache- 
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miss occurs, further reducing system cost and increasing 
performance in cache-based systems. The IDT79R3001 can 
obtain 1, 4, 8, 16, or 32 words from main memory when 
processing a cache-miss, thus amortizing the cache-miss 
penalty over a large amount of data. 

The IDT79R3001 also performs instruction streaming, which 
is the simultaneous execution of incoming instructions while 
the cache Is being refilled. 

The actual width of the tag bus, and whether or not parity 
over the data parts of each synchronous memory is included, 
is determined according to how the device is initialized. The 
IDT79R3001 can accommodate a TAG bus width of 0-19 bits, 
compatible with a variety of cache sizes and cacheable main 
memory choices. The IDT79R3001 allows the system de- 
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signer to scale the synchronous memory system exactly 
according to the system needs, thus eliminating extra memory 
and logic devices and achieving substantial cost savings with 
no loss of performance. 

Thus, the synchronous memory interface of the IDT79R3001 
allows for high-bandwidth memory systems to be imple- 
mented with a minimum of control logic. This is desirable, 
since RISC performance tends to be a function of memory 
bandwidth. By simplifying the design of the synchronous 
memory system (illustrated in Figure 5), it is easier for the 
system designer to achieve high performance with minimum 
chip count and without requiring ultra-fast or specialty com- 
ponents. 
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Figure 5. IDT79R3001 Synchronous Interface 


The TAG Bus 

The TAG bus of the IDT79R3001 has been designed to 
allow the system designer to implement the exact cache 
configuration that is right for the system. For larger caches, 
low-order TAG bits do not need to be supplied for the TAG 
comparison. Additionally, the number of high-order TAG bits 
supplied is determined by the system designer, according to 
the amount of cacheable main memory the system supports. 
Since most embedded systems would tend to implement 
caches of 16kB and greater, and cacheable memory spaces 
of 32MB or smaller, significant cost and area reductions are 
achieved by configuring a smaller TAG bus. 


The system configures the on-chip TAG comparator at 
RESET Initialization time. If a TAG bit is not to be included in 
the synchronous memory TAG bit compare, a pull-down 
resistor of 4kQ is connected to the appropriate IDT79R3001 
TAG pin. If a TAG bit is to be included, no resistor is required 
(the IDT79R3001 pulls floating inputs to Vcc during RESET 
by asmall pull-up, which is disabled when RESET is negated). 

lf a TAG bit is excluded from the cycle-by-cycle compari- 
son, it is still driven out with the appropriate address value 
during write cycles or asynchronous memory reads. Thus, the 
system designer still has the full 4GB of address space 
available for address decoding, without requiring the synchro- 
nous memory to be able to cache all such addresses. 
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- Figure 6 illustrates a reduced system, which implements 
16kB of Instruction and 16kB of data cache, and 512MB of 
cacheable address space, using just 6 IDT71586 4kx16 
Latched CacheRAM™ components and 4 pull-down resis- 
tors. 

Note that in systems which do not implement the synchro- 
nous memory space as cache, then pull-down resistors would 


IDT79R3001 
RISController 


be added to all TAG pins. The Valid Pin still needs to be 
supplied on each cycle, thus allowing various memory schemes 
to be implemented (such as static column DRAM). However, 
the IDT79R3001 can be initialized to not assert the Valid pin 
as an output during Write cycles, simplifying the design of logic 
to drive the eine! 
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Figure 6. Small Footprint Cache for IDT79R3001 


Cache Update 

When the on-chip TAG comparator indicates that the item 
read from the cache was not the desired item, a cache-miss 
is processed. A main memory (asynchronous) transfer is 
automatically processed. 

The IDT79R3001 desires to update the cache using a burst 


refill of multiple adjacent words from main memory. The. 


processor is “stalled” until the first word of the block is 
available. The processor is then released, and the block of 
words is brought into the cache at the rate of one word per 
CPU clock cycle. | 

Note that if the cache-miss was in the instruction each, the 
processor is capable of simultaneously executing the incom- 
ing instruction stream as the cache is updated, thus effectively 
making the cache update transparent to the system and 
increasing performance. 


Write Cycles 
The IDT79R3001 utilizes a write through cache. That is, 
data written by the processor is both written to the cache and 


main memory simultaneously. Thus, main memory always 
has a current copy of all data. 

_ Typically, latching devices are used between the cache 
subsystem and the slower mainmemory. These Write Buffers 
capture the data simultaneous with the cache update, allow- 
ing the processor to continue to the next cycle without actually 
waiting for the main memory transfer to complete. The 
IDT79R3001 generates parity over the data field on write 
cycles, which can be propagated into both the Synenroneus 
and asynchronous memory spaces. 

When the processor writes less than a 32- bit quantity (a (a 
“nartial’ word), the processor can perform a “read-modify- 
write” of the cache. That is, the processor will read the 32-bit 
word containing the partial address(es) to be updated from the 
cache. If a “hit” occurs, then the new data will be merged with 
the old and the new 32-bit value will be written both to the 
cache and to main memory. Ifacache “miss” occurs, then only 
the partial data is written to main memory and the cache is 
unchanged. Partial word capability is selected as a RESET 
option. . 
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THE ASYNCHRONOUS MEMORY INTERFACE 


The IDT79R3001 also supports an asynchronous memory 
interface, which supports the use of slower memory devices 
such as slow DRAM or EPROM and also supports the use of 
peripherals and other “non-cacheable” devices. 

In general, if a cache-miss (or parity error, if enabled) 
occurs, the processor will automatically use the asynchro- 
nous memory interface to retrieve the desired data, and will 
update the cache accordingly. 

Additionally, software can force the use of the asynchro- 
nous memory space through the use of the on-chip MMU. 
When the processor seeks either instructions or data within a 
certain address range (kseg1), the processor knows that this 
datais uncacheable andwill perform an asynchronous memory 
transfer. Additionally, within cacheable memory, TLB entries 
can be used to make certain pages as “uncacheable”. When 
an address of an “uncacheable” page is used, the processor 
will automatically use the asynchronous memory space. 

The asynchronous memory space uses the same data bus 
as the synchronous memory space. This facilitates the auto- 
matic updating of cache memory when the asynchronous 
memory is accessed due to cache-miss activity or memory 
writes. The asynchronous address bus is composed from the 
synchronous memory AddrLo bus, and the TAG bus. External 
logic devices (such as IDT74FCT374A registers) are used to 
capture AddrLo and TAG values for the asynchronous trans- 
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fer address. Note that systems which exclude individual TAG 
bits from comparison (to reduce cache width) still have all 
TAGs available as outputs. 

The data path between the processor and the asynchro- 
nous memory space is managed according to the needs of the 
application. Write Buffer FIFO devices, such as the 
IDT79R3020, are used to capture address and data during 
store cycles. These devices are used to capture the data in 
one-cycle, and allow the processor to continue to execute 
from the synchronous memory while the slower asynchro- 
nous memory actual retires the write. 

The read path is also constructed according to the needs of 
the system. If block refill is used, then the read path is highly 
dependent on the design of the main memory system. Pipe- 
line devices such as IDT74FCT540A, or simple latches such 
as IDT74FCT374, may be used. 

A simple asynchronous memory interface is shown in 
Figure 7. In this system, main memory is assumed to be fast 
enough to support the block refill requirements of the system, 
thus simplifying the read path. In fact, both the read and write 
data paths are actually managed through a single set of 
IDT29FCT52A bidirectional latching transceivers. 

During write cycles (which are typically captured by Write 
Buffers), the processor asserts MemWr to indicate that a write 
cycle is in progress. The memory system negates WrBusy to 
indicate that the processor is done with the write cycle. 























FCT823A 
4 





(4) (4) 
Address Data 
Registers Transceivers - 







Main Memory 
Address (32) 






Main Memory 
Data (32) 







2873 drw 07 


Figure 7. IDT79R3001 Asynchronous Interface 
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Figure 8. IDT79R3001 Interface to IDT79R3010 Floating Point Co-Processor 


During read cycles, the processor will assert MemRd to 


>. indicate that a main memory read is in progress. The memory 


system will hold RdBusy active until the desired data is 
available. The processor will activate the XEn signal to allow 
data to be passed from the main memory to the processor 
databus. If the cache is to be updated with the new data, then 
the processor will assert the appropriate cache write signal to 
allow the cache RAMs to capture the incoming databus. 
The AccTyp bus is used to indicate the size of the data 
transfer (8-, 16-, 24-, or 32-bits), and for main memory reads, 
whether or not the data is “cacheable”. This simplifies the main 


memory address decoding, since the AccTypindicates whether 


the main memory needs to perform a burst read of multiple 
words. 


Co-Processor Interface 

The IDT79R3001 implements a co-processor interface, 
which allows the use of the IDT79R3010 high-performance 
RISC Floating Point Accelerator without requiring the use of 
external interface components. 

The co-processor interface has been designed to make 
system co-processors appear to the programmer as if they 
were on-chip extensions of the core execution engine. Thus, 
the IDT79R3010 FPA works.as a.true co-processor, rather 
than as a peripheral which must be programmed. | 

In the IDT79R3001 co-processor model, the CPU is re- 
sponsible for controlling all data cycles. The co-processor 
keeps in synchronization with the CPU (including the pipeline 
stages), and uses a Phase-Locked Loop to keep synchro- 


nized with the processor bus traffic. The co-processor then 
“snoops’ the data bus, watching for co-processor instructions. 
It also knows when data cycles on the bus are intended for it 
(either as a target in co-processor load operations, or as a 
source for co-processor restore operations), and performs the 
data portion of the operation when appropriate. Thus, co- 
processors effectively load and store directly with memory, 
without requiring operands to go through the CPU first. This 
achieves the highest levels of performance (note that the co- 
processor interface also supports move, whereby data canbe 
moved directly between the CPU and any co-processor). 
Figure 8 illustrates the use of the IDT79R3010 in a 
IDT79R3001 system. The co-processor interface manages 
synchronization between the parts, and is used to communi- 
cate status from the co-processor to the CPU. CpBusy, or Co- 
processor Busy, stalls the CPU until the busy co-processor 
resource (requested by a co-processor instruction) is free, 


and CpCond, or Co-processor Condition, is used to report 


status on co-processor test instructions. CpSync, is used to 
help the co-processor stay “locked” to the CPU, so that the co- 
processor knows when data is on the bus to be sampled on 
load operations, or when to place data on the bus for store 
operations. 

Note that the co-processor sits on the same data bus as the 
CPU, but has no connection to the address bus. The CPU is 
responsible for performing all memory addressing, including 


the determination of “cache hit’, write-buffer full cycles, and 


any processing that might be required for cache misses. 
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INTERRUPTS 

The IDT79R3001 features 6 separate interrupt input pins. 
Interrupts are not vectored, but rather cause the general 
exception vector address to be the next execution address. 

These pins are not encoded internally; external logic can 
choose to implement these interrupt lines as either 6 or 64 
interrupt Sources; software would then perform the appropri- 
ate decoding to get to the specific interrupt handler. 

Interrupts are recognized in the ALU stage of the on-chip 
pipeline. Instructions less advanced in the pipeline are “flushed” 
and will be restarted when the return from exception occurs 
(an on-chip register contains the address of the instruction 
which was excepted). Instructions further advanced in the 
pipeline are allowed to continue. Unlike other RISC proces- 
sors, the IDT79R3001 does not require the programmer to 
save and restore pipeline status to allow normal execution to 
be resumed. Depending on the application and exception, at 
most software would need to save/restore the on-chip data 
registers, status register, Exception PC and exception “cause” 
register. 

Note that the co-processor model includes “precise excep- 
tions.” That is, an exception is signaled to the exact instruction 
which generated the exceptional condition. No further state 
commitments are made by the IDT79R3001 and, thus, the 
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exact context at the time of the exception is known to the 
programmer. This is true even for multi-cycle operations, such 
as those of the FPA. 


DMA INTERFACE 

The IDT79R3001 features a simple DMA interface which 
allows an external master to gain control of the synchronous 
memory space. Note that itis notnecessary to include logic on 
the CPU to arbitrate for the asynchronous memory space; the 
read/write buffer interface is where such arbitration logic 
belongs and it is left to the system designer to implement the 
type of asynchronous memory structure that best fits the 
application. 

When an external master “owns” the synchronous bus, the 
CPU will tri-state the following pins and buses: 
¢ AddrLo: The synchronous memory direct address bus. 
- Data & Tag: The synchronous memory RAM data lines. 

- Cache Control: IRd, [Wr, IClk, DRd, DWr and DClk. This 
allows the external master to use the existing control 
lines to control the synchronous memory. 

¢XEn: The read buffer transceiver enable, which will 
allow the external master to use the read/write 
buffer path for DMA. 

¢ Valid: This enables the DMA interface to be used for 
multi-processing applications. 
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Figure 9. IDT79R3001 DMA Interface 
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MODE SELECTABLE FEATURES FOR THE IDT79R3001 
[input [| Weyle | XC CVO 





NOTE: 


1. Reserved signals must be "HIGH" during these cycles. 


The DMA interface consists of a single input signal, 
DMAStall, which causes the processor to stall and to tri-state 
the above named lines. The external master is guaranteed 
mastership of the bus within a very short number of cycles, 
depending on the exact external bus activity of the CPU when 
the DMA was requested. The DMA master negates the 
DMAStall signal when the DMA operation is completed to 
allow the CPU toresume processing. Consult the /DT79R3001 
Hardware User's Guide for more details. 

Figure 9 illustrates the system connection of an external 
DMA master to a IDT79R3001 system. 


ADVANCED FEATURES 


| The IDT79R3001 contains special features which provide 
~ added flexibility across a number of applications, as well as 
allow for system diagnostic support. | 
Insupport of diagnostics, the IDT79R3001 allows for cache 
“swapping” (interchange of which memory bank is for instruc- 
tion and which is for data), which is useful in system initializa- 
tion, cache flushing, and diagnostics. Additionally, the caches 
can be “isolated” from main memory, which forces cache “hits” 
to occur regardless of the tag comparison, and which is useful 
in determining that the synchronous memory space RAMs are 
functional. 


Multiply/Divide Registers 


General Purpose Registers 


G 
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Figure 10. IDT79R3001 Registers 
















. | 2873 thi 01 
An additional feature is the ability to enable parity checking 
over the data field of each synchronous memory. If parity is 
enabled, the processor will check the parity when a synchro- 
nous access occurs; if a parity error is detected, itis signaled 
to the external world on the Parity Error signal and a cache- 
miss cycle is processed. the Parity Error signal will remain low 
until the parity error flag in the CPO status register is cleared 
by software. 

_A number of other system selectable features are selected 
at reset time. The input reset “vectors” are sampled on the 
interrupt input lines during the last four cycles of the reset 
period. The input vectors are listed in Table 1. These selec- 
tions include the ability to select the block refill sizes for each 
of the instruction and data memories, whether Big Endian or 
Little Endian order is to be used, whether to use data parity, 
and whether or not to accommodate a Phase-Locked Loop for 
a co-processor. The initialization of the CPU and meaning of 
each input vector is more fully explained in the /DT79R3001 
Hardware User's Guide. 


PROCESSOR ARCHITECTURE 

The IDT79R3001 is a full implementation of the 
IDT79R2000A/IDT79R3000 Instruction Set Architecture (the 
MIPS-I ISA). This architecture is discussed in great detail in 
MIPS RISC Architecture, available from IDT. 


IDT79R3001 CPU Registers 

The IDT79R3001 CPU provides 32 general purpose (or- 
thogonal) 32-bit registers, a 32-bit Program Counter and two 
32-bit registers used to hold the results of the CPU integer 
multiply and divide operations. 

Two of the 32 general registers have special purposes 
designed to increase processor performance: register r0 is 
hardwired to the value “0”, a useful constant; and register r31 
is used as the link register in jump-and-link instructions (the 
return address for subroutine calls). Otherwise, there is no 
requirement that a particular register be used as a stack or 
frame pointer, etc., although there is a register convention as 
part of the “mips ABI” (Applications Binary Interface standard) 
which the compiler suite uses. 

The CPU registers are illustrated in Figure 10. Note that 
there is no Program Status Word register shown in this figure. 
The functions traditionally provided by a PSW register are 
instead provided in the Status and Cause Registers incorpo- 
rated within the on-chip System Control Co-Processor (CPO). 
The instruction set does not use condition codes. 
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instruction Set Overview 

All 1DT79R3001 instructions are 32 bits long and there are 
only three instruction formats (see Figure 11). This approach 
simplifies decoding, thus minimizing instruction execution 
time. The IDT79R3001 processor initiates a new instruction 
on every RUN cycle, and is able to complete an instruction on 
almost every clock cycle. The only exceptions are the LOAD 
instructions and BRANCH instructions, which each have a 
single cycle of latency associated with their execution (that is, 
the instruction immediately after the branch is always ex- 
ecuted regardless of the branch condition; similarly, the data 
loaded by aLOAD instruction is not available to the subsequent 
instruction). However, in the majority of cases the compilers 
(and even the MIPS assembler) are able to reorder instruc- 
tions to fill these latency cycles with useful instructions which 
do not require the results of the previous instruction (in the 
worst case, a NOP instruction is inserted). This effectively 
eliminates these latency effects and does not require the 
applications programmer to be aware of the pipeline structure. 

The actual instruction set of the CPU was determined after 
extensive simulations to determine which instructions should 
be implemented in hardware and which operations are best 
synthesized in software from other basic operations. This 
methodology has resulted in the highest performance proces- 
sor available. 

The IDT79R3001 instruction set can be divided into the 
following groups: 
¢ Load/Store Instructions move data between memory and 
the general registers. These are all “I-Type” instructions. 
The only addressing mode supported is base register plus 
signed, immediate 16-bit offset. This effectively allows three 
addressing modes: register plus offset, register (using zero 
offset), and immediate (using r0,the zero register). 

The Load instruction has a single cycle of latency, as 
described above. That is, the instruction immediately after 
the load instruction cannot rely on the new data; however, 
the assembler and compilers automatically handle this, 
reordering code to insure that no conflicts occur. Note that 
the store operation has no latency in its effect. 

Loads and stores can be performed on byte, half-word, 
word, or unaligned word data (32-bit data not aligned on a 
modulo-4 address). 

Computational Instructions perform arithmetic, logical, 
and shift operations on values in registers. They occur in 
both “R-Type” (both operands and the result are general 
registers), and “l-Type” (one operand is a 16-bit immediate 
value) formats. 

Note that computational instructions are three operand 
instructions: that is, the result register can be different from 
both source registers. This means that operands need not 
be overwritten by arithmetic operations. This results ina 
more efficient use of the register set, and further increases 
performance. 


oe (Immediate) 
26 25 21 20 16 


se (Jump) 
es 25 


bi (Register) 
se Ss iy ee aa 6 cae 
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Figure 11. IDT79R3001 Instruction Formats 


¢ Jump and Branch Instructions change the flow of control 
of a program. Jumps are always to a paged absolute 
address formed by combining a 26-bit target with four bits of 
the Program Counter (“J- Type” format for subroutine calls), 
or 32-bit register byte addresses (“R-Type,” for Returns and 
dispatches). Branches have 16-bit offsets relative to the 
program counter (“I-Type’”). 

Jump and Link instructions save a return address in 
Register 31. The IDT79R3001 instruction set features 
numerous branch conditions. Included is the ability tobranch 
based on a comparison of two registers, or on the compari- 
son of a register to zero. Thus, net performance is in 
creased since the processor does not have to precede the 
branch instruction with arithmetic operations. 
Co-processor Instructions perform operations in the co- 
processors (such as the IDT79R3010 FPA). Co-processor 
Loads and Stores are “I- Type;” computational instructions 
have co-processor dependent formats. 

Co-processor 0 Instructions perform operations on the 
System Control Co-processor (CPO) registers to manipu- 
late the memory management and exception handling 
facilities of the on-chip co-processor. 

Special Instructions perform a variety of tasks, including 
movement of data between general and special registers, 
system calls, and breakpoint operations. These are always 
“R-Type.” 


IDT79R3001 System Control Co-processor (CPO) 
The IDT79R3001 can operate with up to four tightly coupled 
- co-processors, designated CPO-CP3. CPO is included on- 
chip as co-processor 0, the System Control co-processor. 
CPO is responsible for supporting both the virtual memory 
system and the exception handling functions of the 
IDT79R3001. 
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IDT79R3001 INSTRUCTION SUMMARY 


Description 


_Loead/Store Instructions 
Load Byte 
Load Byte Unsigned 
Load Halfword 
Load Halfword Unsigned © 
Load Word 
Load Word Left | 
Load Word Right 
Store Byte 

_ Store Halfword: 

_ Store Word 
Store Word Left. . 
Store Word Right 


Arithmetic Instructions 
(ALU Immediate) 
Add Immediate 
Add Immediate Unsigned 

~ Set on Less Than Immediate 
Set on Less Than Immediate _—- 

Unsigned BGEZ 
AND Immediate ; | | 
OR Immediate. : ae q BLTZAL 
Exclusive OR Immediate . | ME BGEZAL 
Load Upper Immediate | | . % 


‘Arithmetic Instructions _ SYSCALL 


(3-operand, register-type) | BREAK _. 


Add — | 
Add Unsigned oe 
Subtract | hs ~ LWCz 
Subtract Unsigned SWCz 
Seton Less Than | MTCz 
| | tte MFCz 
Set on Less Than Unsigned CTCz. 
AND | re | CFCz. 
OR ae a COPZ . 
| ExelusiveOR = BCzT 
NOR © . | BCzF 


Shift Instructions 

Shift Left Logical 

Shift Right Logical 

Shift Right Arithmetic 

Shift Left Logical Variable 
Shift Right Logical Variable 
Shift Right Arithmetic Variable 


5.2 


- Description 


| Multiply/Divide Instructions 


Multiply 
Multiply Unsigned 
Divide 


Divide Unsigned 


Move From HI 
Move To HI 
Move From LO 
Move To LO 


Jump and Branch Instructions 
Jump 


Jump and Link 


Jump to Register 


_ Jump and Link Register 


Branch on Equal 
Branch on Not Equal 
Branch on Less than.or Equal to Zero 


Branch on Greater Than Zero 


Branch on Less Than Zero | 


| Branch on Greater than or 


Equal to Zero 

Branch on Less Than Zero and Link 
Branch on Greater than or Equal to 
Zero.and Link: ig. 
Special Instructions 


_ System Call 


Break 


Coprocessor Instructions 
Load Word from Coprocessor 


_ Store Word to Coprocessor 


Move To Coprocessor » 

Move From Coprocessor 

Move Control to Coprocessor 
Move ‘Control From Coprocessor 
Coprocessor Operation. | 
Branch on Coprocessor z True. 


__ Branch on Coprocessor z False — 


System Control Coprocessor 
(CPO) Instructions | 

Move To CPO 

Move From CPO © — 


- Read indexed TLB entry 


Write Indexed TLB entry 
Write Random TLB entry . 
Probe TLB for matching entry 
Restore From Exception 
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CPO Registers 
As aco-processor, CPO has a number of registers which it 
uses to perform its control functions. These include 64 fully 


associative Translation Lookaside Buffers (TLBs), used to 

manage the virtual memory space; registers to manage the 
TLB set; and the exception handling registers. Figure 12 

illustrates the register set of the System Control Co-processor. 
Table 3 provides a brief explanation of the function of eachof 68 

these registers. A more detailed explanation of the use of each 

of these registers is included in the M/PS RISC Architecture i 

addressing needs of other tasks. This is also useful insystems 0 BY RANDOM 

with limited physical memory; the IDT79R3001 provides for 

the logical expansion of memory by translating addresses [7] Used with Virtual Memory System 
composed in a large virtual space into available physical 

memory addresses. 















RANDOM 


manual. 





Memory Management System 
The IDT79R3001 supports a virtual memory system, so 
that each task in a given application can be unaware of the 


“NN © 


Used with Exception Processing 
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Figure 12. The System Control Co-processor (CPO) Registers 


CPO REGISTERS 


















Status Mode, interrupt enables and diagnostic status 
information 
Indicates nature of last exception 
EPC Exception Program Counter—contains address of 
instruction which detected the exception 
Pointer into kernel’s virtual Page Table Entry array 


BadVA Most recent bad virtual address 
Processor revision identification (Read only) 
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MMU ADDRESS TRANSLATION 
VIRTUAL 


OxELLELL OL 
Kernel Mapped 
Cacheable 
(kseg2) 
Oxc0000000 


Kernel Uncached 

kseg1 
Oxa0000000 ( g ) 
Kernel Cached 


ksegO 
Ox80000000 ( g ) 


User Mapped 
Cacheable 
(kuseg) 


Ox00000000 


IDT79R3001 Operating Modes 

The IDT79R3001 has two operating modes: User Mode 
and Kernel Mode. The IDT79R3001 normally operates in the 
User Mode until an exception is detected, forcing it into the 
Kernel Mode. The processor remains in Kernel Mode until the 
exceptions are handled and the processor executes an RFE 
(Return from Exception) instruction, which will restore it to 
User Mode. Kernel Mode allows software to alter machine 
state information such as that contained in the CPO registers; 
that is, if in User Mode an access is attempted to Co-processor 
0 and the Kernel has not enabled the User to access the co- 
processor, an exception will occur. Similarly, if a User task 
attempts to use a Kernel virtual address, an exception will 
occur. Thus, system resources are protected from User tasks. 

The manner in which memory addresses are translated 
(mapped) depends on the operating mode of the IDT79R3001 
and on the virtual address desired. Figure 13 illustrates the 
virtual address mapping performed by the IDT79R3001: 

User Mode — in this mode, a single, uniform virtual ad- 
dress space (kuseg) of 2GB is available to each user task 
(tasks are further identified by a 6-bit process identifier field in 
order to form unique virtual addresses). All references to this 





PHYSICAL 


Physical | 3584 MB 


Memory 


512 MB 
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segment are mapped using the TLB, which utilizes both the 

virtual address and the Process ID field to perform the virtual- 

to-physical mapping (note that this allows the cache to be 
shared by up to 64 User processes at a time without requiring 
time consuming Cache or TLB flushing). 

Kernel Mode—Four separate segments are accessible 
through this mode: 

* kuseg—When in the Kernel Mode, references to this seg- 
ment are treated just like User Mode references, thus © 
streamlining Kernel accesses to User memory. 

¢ kseg0—References to this 512MB segment may use the 
cache memory, but are not translated by the TLB. Instead, 
these addresses map directly to the first 512MB of the 
physical address space. Note that many dedicated embed- 
ded applications will utilize this address space and kseg1 only, 
rather than any of the TLB mapped segments. 

¢ kseg1—References to this 512MB segmentare not mapped 
through the TLB. Additionally, this memory is viewed as 
uncacheable, which means that references through this 
segment will always use the asynchronous memory inter- 
face. As with ksegO, references through this segment are 
hard-mapped to the first 512MB of physical memory. When 
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ce 





ENTRYHI 


VPN — Virtual Page Number 
TLBPID — Process ID 

PFN — Physical Frame Number 

N — Non-cacheable Physical Page 


ENTRYLO 


D — Dirty Page/Write Protect 
V — Valid entry flag 
G — Global flag (ignore PID) 
O—Reserved — 
2873 drw 14 


Figure 14. TLB Entry Format 
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the processor boots, the reset vector is contained in this 
segment, so that the processor does not require either the 
cache or the TLB to be valid at RESET time. 
kseg2—References to this 1GB segment are always 
mapped through the TLB. As with kuseg, the ability of 
memory pages to be cached is determined by a bit setting 
in the TLB entry for that page. 


The Translation Lookaside Buffer (TLB) 

The translation of virtual addresses in either kuseg or kseg2 
(mapped segments) is performed by the on-chip Translation 
Lookaside Buffer array. This array consists of 64 fully-asso- 
ciative (content addressable) memory elements. Each entry 
maps a 4kB virtual page to a 4kB physical page. Each TLB 
entry contains other information about the virtual address: it 
maps (such aswhich User process it maps) and also about the 
physical address (such as whether it is cacheable or writeable). 

Figure 14 illustrates the format of each TLB entry. The 
translation operation is illustrated in Figure 15. The upper 
portion of the desired virtual address is compared against the 
VPN field of each TLB entry. Additionally, the current process 
ID (contained in the TLBHI register) is matched against the 
PID field of the TLB entry (if the TLB entry is marked as Global, 
the PID comparison is ignored). lfamatch occurs, and the TLB 
entry is marked as Valid, then the translation is completed by 
replacing the VPN of the virtual address with the correspond- 
ing PFN (Physical Frame Number). 


Current 
Process ID 


w 
— 





Note that the use of the TLB does not incur an execution 
penalty, since the execution engine pipeline includes stages 
to cover for the time required to make the TLB search and 
translation. 

TLB misses occur when no successful match occurs. 
These events are handled in software. The CPO registers give 
the software enough information to obtain the appropriate TLB 
entry at speeds which exceed those achieved by many CPUs 
which use hardware TLB replacement (10-12 cycles under 
UNIX). 

When a TLB miss occurs, the address of the instruction 
which was executing is stored in the EPC register, and the 
BadVA register contains the address which was being trans- 
lated. The Context register uses the BadVA value to generate 
a direct pointer to the kernel Page Table Entry for the desired 
virtual address. The Random register suggests the TLB entry 
to be replaced by the new entry. Note that the lower eight TLB 
entries are not pointed to by Random; the kernel software can 
thus insure that it is constantly mapped, and deterministic 
response is guaranteed. 


BACKWARD COMPATIBILITY WITH 
IDT79R2000A AND 79R3000 PROCESSORS 

The IDT79R3001 can execute the same binary software 
(either kernel or user) that is executed by either the 
IDT79R2000A or IDT79R3000. At the system level, some 
hardware re-design is necessary to achieve the cost savings 
inherent in the IDT79R3001 hardware interface. 


12 11 0 
Virtual 
Address 
Physical 
Address 
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Figure 15. Virtual to Physical TLB Translation 
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PIN DESCRIPTIONS 


Memory Interface . 


Data (0:31) — VO | A 32-bit bus used for all instruction and data transmis 
asynchronous memory space and co-processors. . 


DataP (0:3). VO. | A 4-bit bus containing even parity over the data bus. If parity checking is enabled, a parity error will cause the Perr 


| signal to be asserted and a cache-miss to occur. Regardless of whether parity checking is enabled, the processor 
Tag (13:31) 


.will always generate parity on writes. 
AddrLo (0:23) is | 


A 19-bit bus used for transferring cache tags and high-order addre 
Synchronous Memory Control 


asynchronous memory spaces. . 

A 24-bit bus containing low-order byte addresses for both the synchronous ( 
‘IRd = sf The output enable for the instruction cache. The polarity of this signal is selectable. 
Wr 





Description 



















sion among the processor, synchronous memory space, 




















ss bits between the processor, caches and 





cache) and asynchrorious memory | 
spaces. | | | 


The write enable for the instruction cache. The polarity of this signal is selectable. 7 


ICk | O | The instruction cache address latch clock. The clock runs continuously. | | 
‘DRG =i asd] The output enable for the data cache. The polarity of this signal is selectable. 
‘Dwr (sid CO The write enable for the data cache. The polarity of this signal is selectable 






DClk | The data cache address latch clock. The clock runs continuously. 


Valid VO | Ahigh on this signal indicates that the Tags just read from the cache are valid. When a cache update occurs, the 
processor will generate the appropriate Valid bit. a an 


"If parity checking is enabled, this signal is an active low output of the internal CPO parity error Status bit. It is driven 
| low when a parity error is detected and remains low until software clears the parity errorflag in the status register. This 
Asynchronous Memory Interface | | 


pin is physically the same pin as AccTyp2. Its function is selected during device reset. 
/XEn ——«|_(O | The transceiver enable for the read buffer. | See | 
AccTyp (0:2) A3-bit bus used to indicate the size of data being transferred on the asynchronous memory bus, whether or not a data 
transfer is occurring and the purpose of the transfer. If parity checking is enabled, AccTyp2 becomes the PErr signal. 


MemWr___|_© | Signals the occurrence of an asynchronous memory writecycle, 
Memad | _© | Signals the occurrence of an asynchronous memory readeyole, 


. | O | ‘Indicates the.instruction about to commit processor state should be aborted and other exception related information. 
| SysOut = ||: A clock derived from the internal processor clock used to generate the system clock. 



































The asynchronous memory read stall termination signal. In most system designs, RdBusy is normally asserted and 
is deasserted only to indicate the successful completion of the memory read. RdBusy is sampled by the processor 
only during memory read stalls. : 


|WrBusy | I | The asynchronous memory write stall initiation/terminatio 
Co-Processor Interface 7 = i 


| CpSync_ ss |_: O _| Aclock which is identical to SysOut and used by co-pro 
CPBusy | | | The co-processor busy stall initiation/termination signal. 


CpCond (0:3) A 4-bit bus used to transfer conditional branch status from the co-processors to the CPU. CpCond(0) is used to control 
whether or not a cache burst refill occurs; the other signals are used as input port pins for co-processor branch 
instructions. | _ 


Processor Control Signals 


DMAStall i DMA Stall. Signals to the processor that it should stall accesses 


synchronous memory interface. | : 

Int (0:5) A 6-bit bus used to signal maskable interrupts to the CPU. A reset time, mode values are sampled from this bus to 
initialize the processor. During normal operation, these signals are not latched by the processor.and must remain 
asserted until the processor acknowledges the interrupt (through software) to the interrupt source. 

Clk2xSys Pale | The master double frequency input clock, used to generate SysOut. 


Clk2xSmp/Rd ea A double frequency clock input used to determine the sample point for data coming into the CPU and co-processors | 












n signal. WrBusy is only sampled during write operation. 











cessors for timing synchronization with the CPU. 








to the synchronous memories and tri-state the | 










and used to determine the enable time of the synchronous memory RAMs. 


Clk2xP hi ih A double frequency clock input used to determine the position of the two internal phases. 





Initialization input used to force execution starting from the reset memory address. Reset should be asserted 
asynchronously but must be negated synchronously with the leading edge of SysOut. 
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ABSOLUTE MAXIMUM RATINGS“ 3) RECOMMENDED OPERATING 
Symbol] Rating | Gommercial_| Unt] TEMPERATURE AND SUPPLY VOLTAGE 
vienM [Terminal Votage [~-05t047.0 | V [Temperature [GND [Veo | 
with Respect Commercial 0°C to +70°C OV 5.0 +5% 
to GND 16-33 MHz (Ambient) 
TA, TC Operating 0 to +70) °C : : 
Temperature (Ambient) Commercial 0°C to +90°C 5.0+5% 
(Case) 2873 tbl 07 





TBIAS Case Temperature —55 to +1254) °C 
Under Bias 0 to +9065) 
TSTG Storage —55 to +125 °C 
Temperature 


FIN Input Voltage —0.5 to +7.0 


NOTE: : 2873 tbl 05 

1. Stresses greater than those listed under ABSOLUTE MAXIMUM RAT- 
INGS may cause permanent damage to the device. This is a stress rating 
only and functional operation of the device at these or any other conditions 
above those indicated in the operational sections of this specification is not 
implied. Exposure to absolute maximum rating conditions for extended VREF . 
periods may affect reliability. To Device 

2. ViNminimum = —3.0V for pulse width less than 15ns. +1.5V Under Test 
Vin should not exceed Vcc +0.5 Volts. 

3. Notmore than one outputshould be shorted ata time. Duration of the short 25pF 
should not exceed 30 seconds. 

4. 16-25 MHz only. = 

5. 40 MHz only. 





OUTPUT LOADING FOR AC TESTING 





—4mA 
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AC TEST CONDITIONS 
[Symbol | ___ Parameter 
[Vi | Input HIGH Vonage _ 














iRd, IWr, DRd, DWr 
All Others 


Cin [ Max [Unit] 
Se ae 
ese 2 
[oa [iv _ 
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DC ELECTRICAL CHARACTERISTICS: a be Soe 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Voo=+5.0V+5%) 





















~ Test Conditions 
Output HIGH Voltage | Voc = Min., loH = -4mA 
VoL 


| Vo. | Output LOW Voltage | Voc =Min., lol =4mA 
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Input LOW Voltage _ 


Parameter 










co) 
uo 
ad 
o 


=) 
SN 


od 
oO 
4 m wo el ed 
s oO oy oO} oj; ol 


o Oo ‘ 
B |! Jo 
nd 
is 
ae 
ome 


© 

[ee) 
-) S) 
ff oe 








= = 
> | > Ss 
= 
oy) 
ro) 


Ae w no 

i) ro) oO 

— wo Sa ba 
oO ro) of! jofa]! 





Tvs [pa tow vonage) [J 
Ce ee oe eae 
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Tix nputow tearage®)—[we-ano [00] — [oof — [ooo] — [100] — 
Output Tri-state Leakage 100 | -100 | 100 |-100 | 100 |-100 | 100 | pA 





NOTES: oy ae S 2873 tbl 08 
. Vit Min. = —3.0V for pulse width less than 15ns. VIL should not fall below —0.5V for larger periods. ane a 


1 

2. VIHs and ViLs apply to Clk2xSys; Clk2xSmp/Rd, Clk2xPhi, CpBusy,-and Reset. _ 
3. These parameters do not apply to the clock inputs... 
4 


. VoHTand VoLT apply to the bidirectional data and tag buses only. Note that Vinand Vit also apply to these signals. VOHT and VOLT are supplies as additional. 
information to help the system designer understand the relationship between current drive and output voltage on these pins.. ~ —— 


. Vit should not be held above Vcc + 0.5 volts. 

. The IDT79R3001 contains an internal pull-up/current source on the TAG pins to facilitate initialization. This current source is disconnected when Reset 
is inactive. . 

7. Guaranteed by design. 

8. VoHc applies to RUN and Exception. 


oo 
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DC ELECTRICAL CHARACTERISTICS 
COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to +90°C, Vcc = +5.0V +5%) 
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Output Capacitance(®) 
lice | Operating Current Vcc = 5V, TA= 70°C 
Fim | Input HIGH Leakage VIH = VCC 


Ic 850 
IH 
IL 
lo 


100 


| Input LOW Leakage’) Vit = GND —100 
| loz | Output Tri-state Leakage | VoH=VCC,VoL=GND | —100 00 


NOTES: 2873 tbl 09 
1. Vit Min. = —3.0V for pulse width less than 15ns. Vit should not fall below —0.5 Volts for larger periods. 
2. VIHs and VILs apply to CIk2xSys, Clk2xSmp, Cik2xRd, Clk2xPhi, CpBusy, and Reset. 
3. These parameters do not apply to the clock inputs. 
4. VoHT and VoLtT apply to the bidirectional data and tag buses only. Note that ViH and Vit also apply to these signals. VOHT and VoLT are provided 
to give the designer further information about these specific signals. 
. Vi should not be held above Vcc + 0.5 volts. 
. Guaranteed by design. 
7. VoHc applies to RUN and Exception. 
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AC ELECTRICAL CHARACTERISTICS(1,4) et a 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Voc =+5.0V+5%) 
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Load= 25pF 
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NOTES: 1. Alll timings are referenced to 1.5V. 5. Teycis one CPU clock cycle (2 cycles ofa 2xclock). 
2. The clock parameters apply to all three 2xClocks: Clk2xSys, 6. With the exception of Run , no two signals on a given device will 
Clik2xSmp/Rd, and Clk2xPhi. derate for a given load by a difference greater than 15%. 

3. This parameter is guaranteed by design. 7. Transition time <2.5ns for 33MHz; <5ns for lower speeds. 


4. These parameters are illustrated in detail in the /DT79R3001 
Hardware Interface Guide. dee tien se.. 8 
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AC ELECTRICAL CHARACTERISTICS":4) 
COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to +90°C, Voc = +5.0V +5%) 


Clock 


TckHigh | Input Clock HIGH(2) Transition < 2.5ns 
TckLow | Input Clock LOW() Transition < 2.5ns 
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ae) 
wn 


=) 5 
” ” 


12.5 500 
Tcyc/4 
Tcyc/4 


ns 


Clk2xSys to Clk2xSmp/Rd(®) 
Clk2xSmp/Rd to Clk2xPhi() 





wo a} an 
© oO}; oO 


ns 








Run Operation 


(3) 
(3) 
(1:0) 


| 
—_ 
o1 
_) 
” 






b 


ea NI No]: 
or] © : ? 
= 5 = 


= 
on 
=) 
Nn 






ine) 
© 
> 
o” 


ns 


=) 
io) 


Data Hold 


Stall Operation 
| TSAVal Load= 25pF 


TMRdi | Memory Read Initiate Load= 25pF | 
Memory Read Terminate Load= 25pF 


oT 
oe ae oe) 
n 


| | 
Nw w]e 
wn} © 


Oo 








2 


3S i535 
yn | n 


ns 






Nh 


Run Terminate Load= 25pF 


N 
Oo 
a) 
” 


Tomape| DMADriveOn—~—~«:sLoad= 25pF 
TDMAEn Load= 25pF 


Reset Initialization 
Reset Pulse Width 


Reset Pulse Width, Pull-downs 
on Tag 


ns 


G 
Oo 
—_] 
” 


— | sk wo — 

oOo; y : 
o on 
a} =} a ] 






a) 
n 


Bi 
n 


oaiaqiljain 
AyovI nist 
= = = a 
= = 







TRST Tceyc 


TRSTTAG 
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NOTES: 2873 tbl 12 
1. All timings are referenced to 1.5V. 

2. The clock parameters apply to all three 2xClocks: Clk2xSys, Clk2xSmp/Rd, and Clk2xPhi. 

3. This parameter is guaranteed by design. 

4. These parameters are illustrated in detail in the /DT79R3001 Hardware Interface Guide. 

5 

6 






. Tcyc is one CPU clock cycle (2 cycles of a 2x clock). 
. With the exception of Run , no two signals on a given device will derate for a given load by a difference greater than 15%. 
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PIN CONFIGURATIONS 








172-Pin Ceramic Flatpack (Cavity Side View) 
CONMOROTMONSE Sr MW OrNOKrANNTNTNDO =) or 
@ 800 ewe wes Be O38 8050555000005 ARS eee wales Oo 
A ABADADDAADD SSSA 6S S$E55S S655r ARABS ARSAAD LS 
Data21 C4 43 AdrLo2 
Data22 C— AdrLo3 
Data24 L— AdrLo4 
Data25 C— AdrLo5 
Data26 -— AdrLo6 
Data31 “— AdrLo7 
DataP3 C— AdrLo8 
Data27 Ct AdrLo9 
Data23 L— AdrLo10 
-XEn CS AdrLo11 
Data29 C— AdrLo12 
Data30 C— AdrLo13 
Exc AdrLo14 
Cik2xPhi C4 VCC15 
GND7 CS VCC16 
GND6 C— VCC17 
CpCond2 C— GND16 
VvcC7 C— GND17 
voce — VCC18 
en 
eNpe —] IDT79R3001 RiSController ee 
GND3 C— VCC20° 
vccs VCC21 
| vcoc4 Cc VCC22 
. vcoc3 — AdrLo15 
GND2 C— CpCondo. 
GNDi CpCond1 
| Clk2xSys + Resvdi 
CpSyne GND19 
MemWr -— GND20 
AcclTyi “— AdrLo16 
Run CJ Adrlo17 
VvCC2 Co Into za 
vec CS Intd 
Cik2xSmp/Rd C4 Int2 
SysOut C= Int3 
DClk = Int4 
___.1Clk (4 IntS 
‘CpCond3 CpoBusy | 
_ MemRd CH WrBusy 
AccTyo C= AdBusy_ 
Acctly2 C= (== BusError 
DMAStall — 1 129 Reset 
AVUUUVUQUGOQNIAYQOQOGHAAVVOVOWOQIONOGOQOWIGAVUNOQOGOVIVOUE TTT OTUUTT. . 
O SMNr ODO Tr Oh ODAORKRAYM tOOWNANTMNTOMOTMNNTrODWMDrONTM 
SESS VAS eS BOON NAAN NOR AGN GRA AN AN ARNASS Boao ES 
Peepers” © 822° CO ese Ol Ses 2000 Se ee eee ec le. whew 
| Lege OG Ss LO0 > COGs>es Tre | 
NOTE: 


1. AccTyp2 is redefined to be Parity Error if the parity enable option is selected at device initialization. 
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PIN CONFIGURATIONS (Continued) 
144-Pin PGA (Top View) 
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NOTE: 2873 drw 18 

1. AccTyp2 is redefined to be Parity Error if the parity enable option is selected at device initialization. 
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Figure 16. Input Clock Timing 
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Rd/SmpOut O) 
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Tsys 2873 drw 20 


Figure 17. Processor Reference Clock Timing 
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Figure 18. Synchronous Memory (Cache) Timing 
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Figure 19. Memory Write Timing 
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Figure 20. Memory Read Timing 
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Coprocessor Store Coprocessor Load 
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Figure 21. Co-Processor Load/Store Timing 
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Figure 22. Interrupt Timing 


























Phase | 1 2 1 | 2 1 | 2 1 | 2 1 2 1 
SysOut cs iS Tsys Tsys Tsys as 
PhiOut | | . UALR 
Mode | Woycle ee Set T an 
rb Rc Re 
oo CK OK POK__OKPOK OKIDK 
afl Tdh mA Tdh ma Tdh a Tah 
Tds Tds Tds Tds 
Reset 
i 
Figure 23. Mode Vector Initialization TdS 2873 drw 26 
NOTES: 


1. Reset must be negated synchronously; however, it can be asserted asynchronously. Designs must not rely on the proper functioning of SysOut prior to 
the assertion of Reset. 

2. If Phase-Lock On or is asserted as mode select options, they should be asserted throughout the Reset period, to insure that the slowest coprocessor in 
the system has sufficient time to lock to the CPU clocks. 

3. Reset is actually sampled in both Phase 1 and Phase 2. To insure proper initialization, it must be negated relative to the end of Phase 1. 
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Figure 24. Entering DMA Stall 
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Figure 25. Completing DMA Stall 
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ORDERING INFORMATION 





IDT —_XXXXX_ XX XX 
Device Type Speed Package Process/ 
Temperature 
Range a 
| Blank Commercial (0°C to +70°C) 

G —-144-Pin PGA 
F 172-Pin Flat Pack 
16 16.67 MHz 
20 20.0 MHz 
25 25.0 MHz 
33 33.0 MHz 
40 | 40.0 MHz 
79R3001 _RISController . Bienes 





VALID COMBINATIONS 
DT 79R3001- 16,20,25,33 All Packages 


79R3001- 40 | G 
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RISC CPU PROCESSOR 
RISCore™ 


IDT79R3500 








FEATURES: 

Efficient Pipelining—The CPU's 5-stage pipeline design 
assists in obtaining an execution rate approaching one 
instruction per cycle. Pipeline stalls and exceptions are 
handled precisely and efficiently. 

On-Chip Cache Control—The IDT79R3500 provides a 
high-bandwidth memory interface that handles separate 
external Instruction and Data Caches ranging in size from 
4 to 256kBs each. Both caches are accessed during a 
single CPU cycle. All cache control is on-chip. 

* On-Chip Memory Management Unit—A fully-associative, 
64-entry Translation Lookaside Buffer (TLB) provides fast 
address translation for virtual-to-physical memory map- 
ping of the 4GB virtual address space. 

Dynamically able to switch between Big- and Little- Endian 
byte ordering conventions. 

Optimizing Compilers are available for C, FORTRAN, 
Pascal, COBOL, Ada, PL/1 and C++. 

20 through 40MHz clock rates yield up to 32VUPS sus- 
tained throughput. 

Supports independent multi-word block refill of both the 
instruction and data caches with variable block sizes. 


Supports concurrent refill and execution of instructions. 
Partial word stores executed as read-modify-write. 

6 external interrupt inputs, 2 software interrupts, with 
single cycle latency to exception handler routine. 
Flexible multiprocessing support on chip with no impact on 
uniprocessor designs. 

A single chip integrating the R3000 CPU and R3010 FPA 
execution units, using the R3000A pinout. 

Software compatible with R3000, R2000 CPUs and R3010, 
R2010 FPAs. 

TLB disable feature allowing a simple memory model for 
Embedded Applications. 

Programmable Tag bus width allowing reduced cost cache. 
Hardware Support of Single- and Double-Precision Float- 
ing Point Operations that include Add, Subtract, Multiply, 
Divide, Comparisons, and Conversions. 

Sustained Floating Point Performance of 11 MFlops single 
precision LINPACK and 7.3MFLOPS double precision 
Supports Full Conformance With IEEE 754-1985 Floating 
Point Specification 

64-bit FP operation using sixteen 64-bit data registers 
Military product compliant to MIL-STD 833, class B 


IDT79R3500 PROCESSOR 
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DESCRIPTION: 

The IDT79R3500 RISC Microprocessor consists of three 
tightly-coupled processors integrated on a single chip. The 
first processor is a full 32-bit CPU based on RISC (Reduced 
Instruction Set Computer) principles to achieve a new stan- 
dard of microprocessor performance. The second processor 
is a system control coprocessor, called CPO, containing a 
fully-associative 64-entry TLB (Translation Lookaside Buffer), 
MMU (Memory Management Unit) and control registers, sup- 
porting a 4GB virtual memory subsystem, and a Harvard 
Architecture Cache Controller achieving a bandwidth of 
320MBs/second using industry standard static RAMs. The 
third processor is the Floating Point Accelerator which per- 
forms arithmetic operations on values in floating-point repre- 
sentations. This processor fully conforms to the requirements 
of ANSI/IEEE Standard 754-1985, “IEEE Standard for Binary 
Floating-Point Arithmetic.” In addition, the architecture fully 
supports the standard’s recommendations. 

The programmer model of this device will be the same as 
the programmer model of a system which uses a discrete 
IDT79R3000 with the IDT79R3010: 32 integer registers, 16 
floating point registers; co-processor 0 registers; floating point 
status and control register; RISC integer ALU; Integer Multiply 
and Divide ALU; Floating Point Add/Subtract, Multiply, and 
Divide ALUs. The device pipeline will be the same as for the 
IDT79R3000, as will the co-processor 0 functionality. No new 
— instructions have been introduced. Pin compatibility extends 


- to AC and DC characteristics, software execution and initial- - 


.. ization mode vector selection. 

This data sheet provides an overview of the features and 
architecture of the IDT79R3500 CPU, Revision 3.0. A more 
detailed description of the operation of the device is incorpo- 
rated in the A3500 Family Hardware User Manual, anda more 
detailed architectural overview is provided in the M/PS RISC 
Architecture book, both available from IDT. Documentation 
providing details of the software and development environ- 
ments supporting this processor are also available from IDT. 


IDT79R3500 CPU Registers 

The IDT79R3500 CPU provides 32 general purpose 32- 
bit registers, a 32-bit Program Counter, and two 32-bit regis- 
ters that hold the results of integer multiply and divide opera- 
tions. Only two of the 32 general registers have a special 
purpose: register r0 is hardwired to the value “0”, which is a 
useful constant, and register r31 is used as the link register in 
jump-and-link instructions (return address for subroutine calls). 

The CPU registers are shown in Figure 2. Note that there 
is no Program Status Word (PSW) register shown in this 
figure: the functions traditionally provided by a PSW register 
are instead provided in the Status and Cause registers incor- 
porated within the System Control Coprocessor (CPO). 
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FPA REGISTERS 

The IDT79R3010A FPA provides 32 sansa purpose 32- 
bit registers, a Control/Status register, and a Revision Identi- 
fication register. 

Floating-point coprocessor operations reference os types 
of registers: 
¢ Floating-Point Control Registers (FCR) 
¢ Floating-Point General Registers (FGR) 
¢ Floating-Point Registers (FPR) 


General Purpose Registers 


31 0 


Multiply/Divide Registers 


w 
— 
oO 


G@ 
— 
oO 


G 
= 
oO 


re 





: 2871 drw 02 
Figure 2. IDT79R3500 CPU Registers a 


Floating-Point General Registers (FGR) 

There are 32 Floating-Point General Registers (FGR) on 
the FPA. They represent directly-addressable 32-bit regis- 
ters, and can be accessed by Load, Store, or Move Operations. 


Floating-Point Registers (FPR) ; 

The 32 FGRs described in the preceding paragraph are 
also used to form sixteen 64-bit Floating-Point Registers 
(FPR). Pairs of general registers (FGRs), for example FGRO 
and FGR1 (Figure 3) are physically combined to form a single 
64-bit FPR. The FPRs hold a value in either single- or double- 
precision floating-point format. Double-precision formatFPRs 
are formed from two adjacent FGRs. 


Floating-Point Control Registers (FCR) 

There are 2 Floating-Point Control Registers (FCR) on the 
FPA. They can be accessed only by Move operations and 
include the following: 
¢ Control/Status register, used to control and monitor ex- 

ceptions, operating modes, and rounding modes; 
¢ Revision register, containing revision information about 
the FPA. 
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Figure 3. FPA Registers 


Instruction Set Overview 

All |IDT79R3500 instructions are 32 bits long, and there 
are only three instruction formats. This approach simplifies 
instruction decoding, thus minimizing instruction execution 
time. The IDT79R3500 processor initiates a new instruction 
on every run cycle, and is able to complete an instruction on 
almost every clock cycle. The only exceptions are the Load 
instructions and Branch instructions, which each havea single 
cycle of latency associated with their execution. Note, how- 
ever, that in the majority of cases the compilers are able to fill 
these latency cycles with useful instructions which do not 
require the result of the previous instruction. This effectively 
eliminates these latency effects. 

The actual instruction set of the CPU was determined after 
extensive simulations to determine which instructions should 
be implemented in hardware, and which operations are best 
synthesized in software from other basic instructions. This 
methodology resulted in the IDT79R3500 having the highest 
performance of any available microprocessor. 
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Figure 4. IDT79R3500 Instruction Formats 


The IDT79R3500 instruction set can be divided into the 
following groups: 
Load/Store instructions move data between memory and 
general registers. They are all I-type instructions, since the 
only addressing mode supported is base register plus 16- 
bit, signed immediate offset. 
The Load instruction has a single cycle of latency, which 
means that the data being loaded is not available to the 
instruction immediately after the load instruction. The 
compiler will fill this delay slot with either an instruction 
which is not dependent on the loaded data, or with a NOP 
instruction. There is no latency associated with the store 
instruction. 
Loads and Stores can be performed on byte, half-word, 
word, or unaligned word data (32-bit data not aligned ona 
modulo-4 address). The CPU cache is constructed as a 
write-through cache. 
Computational instructions perform arithmetic, logical 
and shift operations on values in registers. They occur in 
both R-type (both operands and the result are registers) 
and |-type (one operand is a 16-bitimmediate) formats. FP 
computational instructions perform arithmetic operations 
on floating point values in the FPA registers. Note that 
computational instructions are three operand instructions; 
that is, the result of the operation can be stored into a 
different register than either of the two operands. This 
means that operands need notbe overwritten by arithmetic 
operations. This results in a more efficient use of the large 
register set. 
Conversion instructions perform conversion operations 
on the floating point values in the FPA registers. 
Compare intructions perform comparisons of the contents 
of FPA registers and set a condition bit based on the 
results. The result of the compare operations is tied 
directly to Cp Cond (1) for software testing. 
Jump and Branch instructions change the control flow of 
aprogram. Jumps are always to apaged absolute address 
formed by combining a 26-bit target with four bits of the 
Program counter (J-type format, for subroutine calls), or 
32-bit register byte addresses (R-type, for returns and 
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ADD.fmt 
SUB.fmt 
MUL.fmt 
DIV.fmt 

ABS.fmt 
MOV.fmt 
NEG.fmt 


C.cond.fmt 


SLL 
SRL 
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Load/Store Instructions 
Load Byte 

Load Byte Unsigned 
Load Halfword 

Load Halfword Unsigned 
Load Word 

Load Word Left 

Load Word Right 

Store Byte 

Store Halfword 

Store Word 

Store Word Left 

Store Word Right 


FPA Load/Store/Move Instructions 
Load Word to FPA 

Store Word from FPA 

Move Word to FPA 

Move Word from FPA 

Move Control word to FPA 

Move Control word from FPA 


Arithmetic Instructions 


(ALU Immediate) 


Add Immediate 

Add Immediate Unsigned 

Set on Less Than Immediate — 
Set on Less Than Immediate 
Unsigned 

AND Immediate 

OR Immediate 

Exclusive OR Immediate 
Load Upper Immediate 


Arithmetic Instructions 


(3-operand, register-type) 


Add 

Add Unsigned 

Subtract 

Subtract Unsigned 

Set on Less Than 

Set on Less Than Unsigned 


~ AND 


OR 
Exclusive OR 
NOR 


FPA Computational Instructions 
Floating point Add 

Floating point Subtract 

Floating point Multiply 

Floating point Divide 
Floating-point Absolute value 
Floating point Move 

Floating point Negate 


FPA Compare Instructions 
Floating-point Compare 


Shift Instructions 
Shift Left Logical 
Shift Right Logical 


IDT79R3500 Instruction Summary 
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SRA 
SLLV 
SRLV 
SRAV 


CVT.S.fmt 
CVT.D.fmt 
CVT.W.fmt 


MULT . 
MULTU 
DIV 
DIVU 
MFHI 
MTHI 
MFLO 
MTLO. 


J 

JAL 
JR 
JALR 
BEQ 


BNE 


BLEZ 
BGTZ | 
BLTZ 
BGEZ 


BLTZAL 


~ BGEZAL 
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SYSCALL 
BREAK 


LWCZ 
SWCZ 
MTCZ 
MFCZ 
CTCZ 
CFCZ 
COPZ 
BCZT 
BCZF 


Shift Instructions (Cont.) 
Shift Right Arithmetic 

Shift Left Logical Variable 
Shift Right Logical Variable 
Shift Right Arithmetic Variable 


FPA Conversion Instructions 

Floating point Convert to Single FP 
Floating point Convert to Double FP 
Floating point Convert to fixed point 


Multlply/Divide Instructions 
Multiply 

Multiply Unsigned 

Divide 

Divide Unsigned 

Move From HI 

Move To HI 

Move From LO 

Move To LO 


Jump and Branch Instructions 
Jump 


_ Jump and Link 


Jump to Register 

Jump and Link Register - 
Branch on Equal 

Branch on Not Equal - 
Branch on Less than or Equal to Zero 
Branch on Greater Than Zero 
Branch on Less Than Zero 

Branch on Greater than or 

Equal to Zero a 
Branch on Less Than Zero and Link 


- Branch on Greater than or Equal to 


Zero and Link 


Special Instructions 
System Call 
Break 


Coprocessor Instructions 
Load Word from Coprocessor. 
Store Word to Coprocessor 
Move To Coprocessor 

Move From Coprocessor . 

Move Control to Coprocessor 
Move Control From Coprocessor 
Coprocessor Operation 

Branch on Coprocessor z True 
Branch on Coprocessor z False 


System Control Coprocessor — 
(CPO) Instructions 

Move To CPO 

Move From CPO 

Read indexed TLB entry 

Write Indexed TLB entry 

Write Random TLB entry 

Probe TLB for matching entry 
Restore From Exception 
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dispatches). Branches have 16-bit offsets relative to the 
program counter (I-type). Jump and Link instructions save 
a return address in Register 31. The R3500 instruction set 
features a number of branch conditions. Included is the 
ability to compare a register to zero and branch, and also 
the ability to branch based on a comparison between two 
registers. Thus, net performance is increased since soft- 
ware does not have to perform arithmetic instructions prior 
to the branch to set up the branch conditions. 

¢« Coprocessor instructions perform operations in the 
coprocessors. Coprocessor Loads and Stores are I-type. 

* Coprocessor 0 instructions perform operations on the Sys- 
tem Control Coprocessor (CPO) registers to manipulate 
the memory management and exception handling facilities 
of the processor. 

* Special instructions perform a variety of tasks, including 
movement of data between special and general registers, 
system calls, and breakpoint. They are always R-type. 


Table 1 lists the instruction set of the IDT 79R3500 processor. 


IDT79R3500 System Control Coprocessor (CPO) 

The IDT79R3500 can operate with up to four tightly- 
coupled coprocessors (designated CPO through CP3). The 
System Control Coprocessor (or CPO), is incorporated on the 
IDT79R3500 chip and supports the virtual memory system 
and exception handling functions of the IDT79R3500. The 
virtual memory system is implemented using a Translation 
Lookaside Buffer and a group of programmable registers as 
shown in Figure 5. 3 


System Control Coprocessor (CPO) Registers 

The CPO registers shown in Figure 5 are used to control 
the memory management and exception handling capabilities 
of the IDT79R3500. Table 2 provides a brief description of 
each register. 
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SYSTEM CONTROL COPROCESSOR (CP0) 
INSTRUCTIONS 


EntryHi 
EntryLo 
Index 
Random 


Status 
Cause 
EPC 
Context 
BadVA 


PRid 


ENTRYHI | ENTRYLO 


NOT ACCESSED 
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High half of a TLB entry 

Low half of a TLB entry 
Programmable pointer into TLB array 
Pseudo-random pointer into TLB array 


Mode, interrupt enables, and diagnostic status info 
Indicates nature of last exception 

Exception Program Counter 

Pointer into kernel’s virtual Page Table Entry array 
Most recent bad virtual address 


Processor revision identification (Read only) 
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Memory Management System oa | 
The IDT79R3500 has an addressing range of AGB. How- 
ever, since most IDT79R3500 systems implement a physical 
memory smaller than 4GBs, the |DT79R3500 provides for the 
logical expansion of memory space by translating addresses 
composed in a large virtual address space into available 
physical memory address. Two TLB modes are supported. 
When the TLB is used, the 4GB address space is divided into 
2GBs which can be accessed by both the users and the 
kernel, and 2GBs for the kernel only. Virtual addresses within 
the kernel/user segment are translated to physical addresses 
ona 4kB page basis. This mode is typical of UNIX and other 


sophisticated operating systems. When the TLB is disabled, 
mapping is locked as 2GBs as kernel/user, and 1.5GBs as. 


kernel only. This mode requires no TLB manipulation, pro- 


vides large linear address space, and is typical for embedded 


applications. 


TLB (Translation Lookaside Buffer) 

Virtual memory mapping is assisted by the Translation 
Lookaside Buffer (TLB). The on-chip TLB provides very fast 
virtual memory access and is well-matched to the require- 
ments of multi-tasking operating systems. The fully-associa- 
tive TLB contains 64 entries, each of which maps a 4kB page, 
with controls for read/write access, cacheability, and process 
identification. The TLB allows each user to access up to 2GBs 


_ of virtual address space. 


Figure 6 illustrates the format of each TLB entry. The 


- Translation operation involves matching the current Process 


44 43 3837) 


32 31 


ID (PID) and upper 20 bits of the address against PID and VPN 
(Virtual Page Number) fields in the TLB. When both match (or 
the TLB entry is Global), the VPN is replaced with the PFN 
(Physical Frame Number) to form the physical address. | 

TLB misses are handled in software, with the entry to be re- 
placed determined by as imple RANDOM function. The rou- 
tine to process a TLB miss in the UNIX environment requires 
only.10-12 cycles, which compares favorably with ERY, nee 
which penoiny the operation in parowalies ; 


TLB Disabled Operation qe 

Many embedded systems do. not like the: complexity or 
uncertainty associated with the on-chip TLB. However, many 
systems still desire the. ability to implement a kernel/user 
mode. Therefore, to implement a hierachical task model, the 
TLB must be used. The R3500 gives the system designer one 
more option, allowing the TLB to be disabled and performing 
a fixed mapping of virtual to physical addresses, while main- 
taining separation of kernel and user resources. 

The user may elect to disable the TLB through the reset 
sectors. In this case, the mapping shown in Figure 8. is used, 
and device power consumption is reduced. Note tha “cached“ 
segments means: that there is no mechanism to exclude 
addresses in these regions from the cache. | | 

This mapping means that applications designed to run in. 
ksegO and kseg1 (to avoid the TLB) can use the R3500, 
disable the TLB to reduce power, and not have to change 
software to take advantage of this new feature. 





ENTRYHI 


ENTRYLO 


VPN — Virtual Page Number 


TLBPID — Process ID 


PFN — Physical Frame Number 
N — Non-cacheable flag 
D — Dirty flag (Write protect) 


V — Valid entry f 


lag 
G — Global flag (ignore PID) 


O — Reserved 
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Figure 6. TLB Entry Format 
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OxFFFFFFFF 


KERNEL 
MAPPED 
CACHEABLE 
(kseg2) 


0xCO0000000 


KERNEL 
UNMAPPED 
UNCACHED 
(kseg1) 
OxA0000000 


KERNEL 
UNMAPPED 
CACHED 
(ksegO) 
0x80000000 
Ox7FFFFFFFF 


KERNEL/USER 
MAPPED 
CACHEABLE 
(kuseg) 
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OxFFFFFFFF 


PHYSICAL 
MEMORY as 


0x20000000 
Ox1FFFFFFF 


MEMORY 512 MB 
0x00000000 
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Figure 7. IDT79R3500 Virtual Address Mapping 


Oxtfffffff 
Kernel Cached 

(kseg2) 
Oxc0000000 
Kernel Uncached 
Oxa0000000 (kseg!) 


Kernel Uncached 


(ksegO) 


0x80000000 


User Cached 
(kseg) 


0x00000000 
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(TBL Disabled) 





Kernel Cacheabl 
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Kernel/User 


Cacheable 
Taake 2048 MB 


Inaccessible 512 MB 


Kernel Boot 
and I/O 912 MB 
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NOTE: This model is consistent with the mapping available in the IDT79R3051 family. The identical mapping provides software compatibility to the 


lower cost CPUs. 


Figure 8. TLB Disabied Mapping 
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Operating Modes 
The IDT79R3500 has two operating modes: ‘User mode 
and Kernel mode. The IDT79R3500 normally operates in the 
User mode until an exception is detected forcing it into the 
Kernel mode. It remains in the Kernel mode until a Restore 
From Exception (RFE) instruction is executed. The manner in 
which memory addresses are translated or mapped depends 
onthe operating mode of the IDT79R3500. Figure 7 shows the 
MMU translation performed for each of the operating modes. 
User Mode—in this mode, a single, uniform virtual address 
space (kuseg) of 2GB is available. When the TLB is used, 
each virtual address is extended with a 6-bit process identifier 
field to form unique virtual addresses. All references to this 
segment are mapped through the TLB. Use of the cache for 


up to 64 processes is determined by bit settings for each page 


within the TLB entries. If the TLB is not used, these addresses 

are translated to begin at 1GB of the physical address space. 
Kernel Mode—four separate segments are defined in this 

mode: 

* kuseg—when in the kernel mode, references to this seg- 
ment are treated just like user mode references, thus 
streamlining kernel access to user data. 


¢ ksegO—references to this 512MB segment use cache 


memory but are not mapped through the TLB. Instead, 
they always map to the first 0.5GB of physical address 
space. 

¢ ksegi—references to this 512MB segmentare not mapped 
through the TLB and do not use the cache. Instead, they 
are hard-mapped into the same 0.5GB segment of physi- 
cal address space as kseg0. 

* kseg2—when the TLB is not used, references to this 1GB 
segment directly addresses the upper 1GB of physical 
address space. These addresses are defined to be kernel 
mode which are cacheable. When the TLB is used, refer- 
ences to this 1GB segment are always mapped through 
the TLB and use of the cache is determined by bit settings 
within the TLB entry. 


FPA COPROCESSOR OPERATION (CP1) 


The FPA continually monitors the processor instruction 
stream. If an instruction does not apply to the coprocessor, it 


is ignored; if an instruction does apply to the coprocessor, the ~ 


FPA executes that instruction and transfers necessary result 

and exception data synchronously to the main processor. 
The FPA performs three types of operations: 

¢ Loads and Stores; 

¢ Moves; 

¢ Two- and three- -register floating-point operations. 
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Load, Store, and Move Operation 
Load, Store, and Move operations data pennant memory 


or the integer registers and the FPA registers. These opera- 


tions perform no format conversions and cause no floating- 
point exceptions. Load, Store, and Move operations reference 
a single 32-bit word of either the Floating-Point General 
Registers (FGR) or the Floating-Point Control Registers (FCR). 


Floating-Point Operations 
The FPA supports the following single- and double-preci- 
sion format floating- point operations: 
* Add — 
¢ Subtract 
¢ Multiply 
¢ Divide 
¢ Absolute Value 


_ ¢ Move 


¢ Negate 
¢ Compare 

In addition, the FPA supports conversions between single- 
and double-precision floating-point formats and fixed-point 
formats. 

The FPA incorporates separate Add/Subtract, Multiply, 
and Divide units, each capable of independent and concurrent 
operation. Thus, to achieve very high performance, floating 
point divides can be overlapped with floating point multiplies 
and floating point additions. These floating point operations 
occur independently of the actions of the CPU, allowing 
further overlap of integer and floating point operations. Figure 
9 illustrates an example of the types of overlap permissible. 


Exceptions 
The FPA supports all five IEEE standard exceptions: 
¢ Invalid Operation 
¢ Inexact Operation 
¢ Division by Zero 
¢ Overflow 
¢ Underflow 
The FPA also Supposne the aBtinal Unimplemented 
Operation exception that allows unimplemented instructions 


_to trap to software emulation routines. | 


The FPA provides precise exception capability to the CPU; 
that is, the execution of a floating point operation which 
generates an exception causes that exception to occur at the 


CPU instruction which caused the operation. This precise 


exception capability is a requirement in applications and 
languages which provide a mechanism for local software 


exception handlers within software modules. 
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these cycles. However, two multiply or two 
divide operation cannot be overlapped. 
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Figure 9. Examples of Overlapping Floating Point Operation 
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Figure 11. IDT79R3500 Execution Sequence 
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IDT79R3500 PIPELINE ARCHITECTURE 


The execution of a single IDT79R3500 integer instruction 
consists of five pipe stages while floating point instruction 


takes six pipe stages. They are: | . 
1) IF—Instruction fetch. The processor calculates the 


instruction address required to read from the | cache.» 
2) RD—The instruction is present on the data bus during 


phase one of this pipe stage. Instruction decode occurs 
during phase two. Operands are read from the registers if 
required. 

3) ALU—Perform the required operation on instruction 
operands. If this is a FPA instruction, instruction execution 
commences. 7 _ ts oe 

4) MEM—Access memory. If the instruction is a load or 
store, the data is presented or captured during phase 2 of 
this pipe stage. - 

5) WB—Write integer results back into register file. In 
FPA cycles this pipe stage is used for exceptions. 

6) FWB—The FPA uses this stage to write back ALU 
results to its register file. 

Each of these steps requires approximately one FPA cycle 
as shown in Figure 10. (parts of some operations spill over into 
another cycle while other operations require only 1/2 cycle.) 

The CPU uses a five stage pipeline while while the FPA 


uses a 6 stage to achieve an instruction execution rate. 


approaching one instruction per cycle. Thus, execution of six 
instructions at a time are overlapped as shown in Figure 11. 

This pipeline operates efficiently because different CPU 
resources (address and data bus accesses, ALU operations, 


register accesses, and so on) are utilized on a non-interfering. 


basis. 


MEMORY SYSTEM HIERARCHY 

The high performance capabilities of the IDT79R3500 
processor demand system configurations incorporating tech- 
niques frequently employed in large, mainframe computers 
but seldom encountered in systems based on more traditional 
microprocessors. | 

Aprimary goal of systems employing RISC techniques is to 
minimize the average number of cycles each instruction 
requires for execution. Techniques to reduce cycles-per- 
instruction include a compact and uniform instruction set, a 
deep instruction pipeline (as described above), and utilization 
of optimizing compilers. Many of the advantages obtained 
from these techniques can, however, be negated by an 
inefficient memory system. | | 

Figure 12 illustrates memory in a simple microprocessor 
system. In this system, the CPU outputs addresses to memory 
and reads instructions and data from memory or writes data to 
memory. The address space is completely undifferentiated: 
instructions, data, and I/O devices are all treated the same. In 
suchasystem, aprimary limiting performance factor is memory 
bandwidth. , 
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Microprocessor 


Data Address 


Memory 
(and |/O) 
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Figure 12. A Simple Microprocessor Memory System 


Figure 13 illustrates a memory system that supports the 
significantly greater memory bandwidth required to take full 
advantage of the IDT79R3500’s performance capabilities. 
The key features of this system are: 


IDT79R3500. 
Microprocessor 
Address 


| 
Cache | 
= Data _ 
Cache 


Write | 
Buffer 


en. heen 


Address 


Main Memory 
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Figure 13. An IDT79R3500 System with a 
High-Performance Memory System 
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¢ External Cache Memory—Local, high-speed memory 
(called cache memory) is used to hold instructions and 
data that is repetitively accessed by the CPU (for example, 
within a program loop) and thus reduces the number of 
references that must be made to the slower-speed main 
memory. Some microprocessors provide a limited amount 
of cache memory on the CPU chip itself. The external 
caches supported by the IDT79R3500 canbe much larger; 
while a small cache can improve performance of some 
programs, significant improvements for a wide range of 
programs require large caches. 

* Separate Caches for data and Instructions—Even with 
high-speed caches, memory speed can still be a limiting 
factor because of the fast cycle time of a high-performance 
microprocessor. The IDT79R3500 supports separate 
caches for instructions and data and alternates accesses 
of the two caches during each CPU cycle. Thus, the 
processor can obtain data and instructions at the cycle rate 
of the CPU using caches constructed with commercially 
available IDT static RAM devices. 

In order to maximize bandwidth in the cache while minimiz- 
ing the requirement for SRAM access speed, the 
IDT79RR3500 divides a single-processor clock cycle into 
two phases. During one phase, the address for the data 
cache access is presented while data previously ad- 
dressed in the instruction cache is read; during the next 
phase, the data operation is completed while the instruc- 
tion cache is being addressed. Thus, both caches are read 
in a single processor cycle using only one set of address 
and data pins. 


¢ Write Buffer—in order to ensure data consistency, all data 


that is written to the data cache must also be written out to 
main memory. The cache write model used by the 


IDT79R3500 is that of a write-through cache; that is, all. 


data written by the CPU is immediately written into the 
main memory. To relieve the CPU of this responsibility 
(and the inherent performance burden) the IDT79R3500 
supports an interface to a write buffer. The IDT79R3020 
Write Buffer captures data (and associated addresses) 
output by the CPU and ensures that the data is passed on 
to main memory. 


IDT79R3500 Processor Subsystem Interfaces 


Figure 14 illustrates the three subsystem interfaces pro- 


vided by the IDT79R3500 processor: 


* Cache control interface (on-chip) for separate data and 
instruction. caches permits implementation of off-chip 
caches using standard IDT SRAM devices. The 
IDT79R3500 directly controls the cache memory with a 
minimum of external components. Both the instruction and 


data cache can vary from 0 to 256kB (64K entries). The 


IDT79R3500 also includes the TAG control logic which 
determines whether or not the entry read from the cache 
is the desired data. The IDT79RR3500 implements an 
advanced feature that allows certain tag comparisons to 
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be eliminated, which in turn reduces the number of cache 
SRAMSs required. The Int(5) reset mode vector contains 
two bits which sets the tag comparison options. Table 3 
illustrates the tag disable encoding. The first row in the 
table implements the standard IDT79R3000A operating 
mode where all the tag and tag parity are used. The 
second row eliminates the upper 4 tag bits, eliminating 
normally required SRAMs and limiting main memory ad- 
dressing to 128mB. The third row elimnates the lower 4 tag 
bits, which requires the cache to be at least 64kB each. 
The fourth row eliminates the upper 4 and lower 4 tag bits, 
requiring at least 16K cache entries, and limits main 
memory addressing to 128mB. Inallcases, the IDT79R3500 
continues to check tag parity which are selected as driven 
from the cache. The IDT79R3500 cache controller imple- 
ments a direct mapped cache for high net performance 
(bandwidth). It has the ability to refill multiple words when 
a cache miss occurs, thus reducing the effective miss rate 
to less than 2% for large caches. When a cache miss 
occurs, the IDT79R3500 can support refilling the cache in 
1, 4, 8, 16, or 32 word blocks to minimize the effective 
penalty of having to access mainmemory. The IDT79R3500 
also incorporates the ability to perform instruction stream- 
ing; while the cache is refilling, the processor can resume 
execution once the missed word is obtained from main 
memory. Inthis way, the processor can continue to execute 
concurrently with the cache block refill. 


¢ Memory controller interface for system (main) memory. 


This interface also includes the logic and signals to allow 
operation with a write buffer to further improve memory 
bandwidth. In addition to the standard full word access, the 
memory controller supports the ability to write bytes and 
half-words by using partial word operations. The memory 
controller also supports the ability to retry memory ac- 
cesses if, for example, the data returned from memory is 
invalid and a bus error needs to be signalled. 


¢ Coprocessor Interface—The IDT79R3500 features a set 


of on board tightly coupled coprocessors. Coprocessor 0 
is defined to be the system control coprocessor and 
Coprocessor 1 is the Floating Point Accelerator. They 
have direct access to the internal data bus which allows 
them direct load and store of data in the same fashion as 
accessing the CPU registers. This relieves the typical 
bottleneck of having to load data into the CPU register set 
and then passing that data off to the co-processors. 


In applications where the FPA was off chip, as in using the 


IDT79R3010A, several control pins were used for commu- 
nications with the CPU and a Phase Lock Loop was 
located on the IDT79R3010A to synchronize the two 
together. As they are now integrated into a single chip, 
these are no longer needed. The FpCond output, which is 
used in coprocessor branch instructions, is now internally 
tied to the CoCond(1) input of the CPU leaving the external 
CpCond(1) pin available for another function. This signal 
is selectable to either output the FpBusy or the Fpint. Cp 
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Cond(1) output selection is determined at reset time ac- 
cording to the value read on Int(4). Table 4 illustrates the 
- options that allow the Fplnt to be routed to either the 
CpCond(1) output, or one of the internal Int pins. If it is 
internally routed, that interrupt is dedicated and that input 
will no longer affect the IDT79R3500. The. selection of 
using CpCond(1) allows some external Logic to be added 
to the path, which might be required in some applications. 
Another method for Fpint handling is also accommodated. 
A mode pin, previously Vcc can be programmed to route 
the FPU interrupt to a dedicated Fpint output that was 


Tag 
Mode 1 






. 


Tag Check | ~—iIgnore— 
Mode 0 Which TAGs Which Tags 







> | tas | re | 
1 [ser [roo ra | 
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Table 3. Tag Disable Encoding 


7 





wi xf sey | 
ycle | Cycle | Cy | 

| x | x | x |eHigH"| FPint driven onto CpCond(1) 
LOW" Use Int(3) for Fpint 


x< 










L ) 

) 
cow’ 
LOW" 
LOW" 
) 
'L 


LOW" | "LOW" "HIGH" | "LOW" | Use Int(1) for Fpint 


U 
'HIGH"| "LOW" |"HIGH" | "LOW" | Use Int(5) for Fpint 

'HIGH"| "HIGH" |"LOW" Reserved, Undefined 
'HIGH" | "HIGH" "HIGH" | "LOW" | Reserved, Undefined 
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- Table 4. Int(4) Encoding for Fpint 
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previously a GND. Ifthe mode pin.is sampled at reset as 
a 0, the dedicated Fpint indicates the FPU interrupt - if a 1, 

_ then the routing of Table 4 applies. S Pa." 
The internal CPBusy input, which is used to stall the CPU 
if. the coprocessor needs to-hold off subsequent opera- 
tions, has two sources-FPBusy and the external CpoBusy 

~ pin which. are logically ORed together. Further, Run and 
Exception of both the FPA and CPU are internally tied and 
brought out with the external CPBusy input to accommo- 
date off chip coprocessor 2 and 3. This external interface 
is available to support application specific functions. 


MULTIPROCESSING SUPPORT 


~ The IDT79R3500 supports multiprocessing applications 
in a simple but effective way. Multiprocessing applications 
require cache coherency across the multiple processors. The 
IDT79R3500 offers two signals to support cache coherency: 
the first, MPStall, stalls the processor within two cycles of 
being received and keeps it from accessing the cache. This 
allows an external agent to snoop into the processor data 
cache. The second signal, MPInvalidate, causes the proces- 
sor to write data on the data cache bus which indicates the 
externally addressed cache entry is invalid. Thus, a subse- 
quent access to that location would result in a cache miss, and 
the data would be obtained from main memory. . 

The two MP signals would be generated by a external logic 
which utilizes a secondary cache to perform bus snooping 
functions. The IDT79R3500 does not impose an architecture 
for this secondary cache, but rather is flexible enough to 
support a variety of application specific architectures.and still 
maintain. cache coherency. Further, there is no: impact on 
designs which do not require this feature. The IDT79R3500 
further allows the use of cache RAMs with internal address 
latches in multiprocessor systems. | -— 


ADVANCED FEATURES | 
The IDT79R3500 offers a number of additional features 
such as the ability to swap the instruction and data caches, 
facilitating diagnostics and cache flushing. Another feature 
isolates the, caches, which forces cache hits to occur regard- 
less of the contents of the tag fields. The IDT79R3500 allows 
the processor to execute user tasks of the opposite byte 
ordering (endianness) of the operating system, has a pro- 
grammable Tag width bus, and further allows certain parity 
checking to be disabled. More details on these features canbe 
found in the /DT79R3000A Family Hardware User's Manual. 
Further features of the IDT79R3500 are configured during 
the last four cycles prior to the negation of the RESET input. 
These functions include the ability to select cache sizes and 
cache refill block sizes; the ability to utilize the multiprocessor 
interface; whether or not instruction streaming is enabled; 
whether byte ordering follows “Big-Endian” or “Little-Endian” 
protocols, etc. Additionally, the IDT79R3500 mode must be 
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true to enable any of the new features that the X,Y, and Z 
cycles define. Table 6 shows the configuration options 
selected at Reset. These are further discussed in the 
IDT79R3000A Family Hardware User's Manual. 


BACKWARD COMPATIBILITY 

Theprimary goal of the |IDT79R3500 is the ability to replace 
the IDT79R3000A and IDT79R3010A with a single chip solu- 
tion. The pinout of the IDT79R3500 has been selected to 
ensure this compatibility, with new functions mapped onto 
previously used pins. The instruction set is compatible with 
that of the R2000 at the binary level. As a result, code written 
for the older processor can be executed. 

In most IDT79R3000A applications, the IDT79R3500 can 
be placed in the socket with no modification to initialization 
settings. Additionally, the IDT79R3500 can be used in sys- 
tems that did not include the IDT79R3010 in the original 
design. Further application assistance on these topics are 
available from IDT. 


PACKAGE THERMAL SPECIFICATIONS 


The IDT79R3500 utilizes special packaging techniques to 
improve both the thermal and electrical characteristics of the 
microprocessor. 

In order to improve the electrical characteristics of the 
device, the package is constructed using multiple signal 
planes, including individual power planes and ground planes 
to reduce noise associated with high-frequency TTL parts. In 
addition, the 161-pin PGA package utilizes extra power and 
ground pins to reduce the inductance from the internal power 
planes to the power planes of the PC Board. 

In order to improve the thermal characteristics of the 
microprocessor, the device is housed using cavity down 






















Tnto DBIkSize0 DBIkSize1 
Int? IBIkSizeO IBIkSize4 

Int2 DispPar/RevEnd [Stream 

Int3 Reserved(1) StorePartial 
Int4 FPINT decode FPINT decode 


Int5 7R3500 mode TLB disable 
NOTES: 


1. Reserved entries must be driven high. 


2. These values must be driven stable throughout the enfire RESET period. 
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packaging. In addition, these packages incorporate a copper- 
tungsten thermal slug designed to efficiently transfer heat 
from the die to the case of the package, and thus effectively 
lower the thermal resistance of the package. The use of an 
additional external heat sink affixed to the package thermal 
slug further decreases the effective thermal resistance of the 
package. 

The case temperature may be measured in any environ- 
ment to determine whether the device is within the specified 
operating range. The case temperature shouid be measured 
at the center of the top surface opposite the package cavity 
(the package cavity is the side where the package lid is 
mounted). | | 

The equivalent allowable ambient temperature, TA, can be 
calculated using the thermal resistance from case to ambient 
(@ca) for the given package. The following equation relates 
ambient and case temperature: 


TA = Ic-P*@ca 


where P is the maximum power consumption, calculated by 
using the maximum Icc from the DC Electrical Characteristics 
section. 

Typical values for @ca at various airflows are shown in 
Table 5 for the various CPU packages. 


a ies 


} 0 | 200 | 400 | 600 | 800 | 1000 


Table 5. R3500 Package Characteristics 














2871 tb! 05 




























Extend Cache Big Endian 
MPAdrDisable TriState 
IgnoreParity NoCache 
MultiProcessor BusDriveOn 

FPINT decode FPINT onto CpCond 
Tag Mode 1 Tag Mode 0 
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Table 6. R3500 Mode Selectable Features 


5.3 13 





IDT79R3500 RISC CPU PROCESSOR RISCore | MILITARY AND COMMERCIAL TEMPERATURE RANGES 









Databus. ~ ~~ Data Bus a a ~ Data Bus 


IDT79R3500A Processor 
_ with System Control 


Data Tag lAdr ___—_-- Coprocessor DAdr Tag Data 
[15:2] a [15:2] | 


Instruction __ | 1 3 Data 
Cache Cache 


Cik2xSmp 


Clk2xPhi 
a Reset 
~ Memory — — 7 : CpSync 
Interface — : Run | 
ee | Exe 
| ‘CpBusy 
CpCond(0) CpCond[2:3] 
BusError int[5:0] 


Hardware 
Interrupts 2871 drw 14 


Figure 14. IDT79R3500 Subsystem Interfaces Example; 64 KB Caches 
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PIN CONFIGURATION 
11 
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vec | Dats Date type Tag30 
5 [= 


NOTE: 2871 drw 16 
1. AdrLo 16 and 17 are multifunction pins which are controlled by mode select programming on interrupt pins at reset time 

AdrLo 16: MP Invalidate, CpCond (2). 

AdrLo 17: MP Stall, CpCond (3). 
2. This package is pin-compatible with the 175-pin PGA for the R3000A. 
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PIN CONFIGURATION 
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WrBusy C— 
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Int(5) C3 
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Int(3) 
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NOTE: 
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. AdrLo 16 and 17 are multifunction pins which are controlled by mode select t programming on intehiank pins at reset time 


AdrLo 16: MP Invalidate, CpCond (2). 
AdrLo 17: MP Stall, CpCond (3). 


2. This package is pin-compatible with the 175-pin PGA for the R3000A. 
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PIN CONFIGURATION 
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PIN DESCRIPTIONS te ee de 
| PinName [vo] escription 


Data (0-31) VO | A 32-bit bus used for all instruction and data transmission among the processor, caches, memory interface, and 


coprocessofs. 
DataP (0-3) 


A 4-bit bus containing even parity over the data bus. _ ,. sa : 
A 20-bit bus used for transferring cache tags and high addresses between the processor, caches, and memory interface. | 
| TagV [VO | Thetagvaldityindicator 
jag P(0-2) [VO | A Seitbus containing even parity overthe concatenation of TagVandTag SS 















AdrLo (0-17) An 18-bit bus containing byte addresses used for transferring low addresses from the processor to the caches and memory 
; ), AdrLo 17: CpCond (3) set by reset initialization). | 
IRd1 : Read enable for the instruction cache 


Write enable for the instructon cache 


a 
a. 
DS) 


An identical copy of IRd1 used to split the load. 
An identical copy of Wr1 used to split the load. 
The instruction cache address latch clock. This clock runs continuously. 


interface. (AdrLo 16: CpCond (2 


O|= 
* |x 


O 
D 
ar 


The read enable for the data cache 


The write enable for the data cache. 
An identical copy of DRd1 used to split the load. 





S 
= 








An identical copy of DWr1 used to split the load 


DClk | The data cache address latch clock. This clock runs continuously. 
XEn The read enable for the Read Buffer 


> 
° 
i?) 
— 
< 
me} 
— 
(jo) 


oo 
=| 2 
NOL} PO 


NO 


A 3-bit bus used to indicate the size of data being transferred on the data bus, whether or not a data transfer is occurring 


and the purpose of the transfer. 


Signals the occurrence of a main memory write. 


= 
0} 
: 





MemRd Signals the occurrence of a main memory read 
BusError Signals the occurrence of a bus error during a main memory read or write. 
Run Indicates whether the processor isin the RUN or STALL state. In the discrete design, the R3000 Run output is tied directly 










to the R3010 Run input. In theIDT79 R3500, this is done internally, but the Run signal is also brought out for application 
specific coprocessors. 


Indicates that the instruction that is about to commit to a state change should be aborted; also indicates other exception 
related information. In the discrete design, the R38000 Exception output is tied to the IDT79R3010 Exception input. In the 
IDT79R3500 this is done internally, but the Exception signal is also brought out for application specific coprocessors. 


Exception 
Aclock which is identical to SysOut and used by external coprocessors for timing synchronization with the IDT79R3500. 
Inthe discrete design, CpSync output from the IDT79R3000 is tied to the IDT79R3010 FPSync input. Inthe IDT79R3500, 


Cpsyne 
| | this is done internally, but the CpSync signal is also brought out for application specific coprocessors. 


RdBusy - The main memory read stall termination signal. In most system designs RdBusy is normally asserted and is deasserted 
only to indicate the successful completion of a memory read. RdBusy is sampled by the processor only during memory 






















read stalls. 


|WrBusy | ‘it | The main memory write stall initiation/termination signal. | 











Input used to indicate that the requested coprocessor resource is unavailable, or used to preserve the precise exception 
model. In the descrete design, CoBusy is driven directly by the R301 0 FpBusy output. Inthe IDT79R3500 the CpBusy input 
of the CPU is the logical OR of both the internal FPA FpBusy and the external CpBusy pin. This input is provided for 
external application specific coprocessors. An internal pull down resistor is provided if this input is left open. 


Signal used by the branch on Coprocessor 1 true/false instruction. In discrete systems using a IDT79R3010 FPA, this is 
normally tied to the FpCond output. In the IDT79R3500, the internal FoCond is directly tied to the internal CpCond(1) input 
leaving this pin available for other functions. This pin defaults to output the FpBusy internal signalor, (via the Reset vectors), 
output the FPInt—in the latter case, external hardware must route this signal to the appropriate Int pin. 


Conditional branch status from coprocessors to the processor. Function is provided on AdrLo 16/17 pins and is selected 
at reset time. 


Multiprocessing Stall. Signals to the processor thatit should stall accesses to the caches in a multiprocessing environment. 
This is physically the same pin as CpCond3; its use is determined at RESET initialization. 


Multiprocessing Invalidate. Signals to the processor that it should issue invalidate data on the cache data bus. The address 
to be invalidated is externally provided. This is the same pin as CpCond2; its use is determined at RESET initialization. 


A 6-bit bus used by the memory interface and coprocessors to signal maskable interrupts to the IDT79R3500. This bus 
is also used at reset time to select among the mode-selectable features of the IDT79R3500. The FPA FP Int output signal 














me? 
CpCond(1) : 


CpCond (0,2-3) 












MPStall 


| MPinvalidate fs 








Int (0-5) 


is typically connected to one of these interrupt lines; the choice is programmable through the reset vectors with the default 
being Int(3). 
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PIN DESCRIPTIONS (Continued) 















Clk2xSys 
Clk2xSmp 


Clk2xRd 
Clk2xPhi 







leading edge of SysOut. 


ir 
Retry 
. 
O 





ABSOLUTE MAXIMUM RATINGS": 9) 
ISymbo!|___ Rating _| Commercial| Military _| 


Terminal Voltage 






VTERM —0.5to +7.0 | —0.5 to +7.0 










with Respect 
to GND 

Operating 
Temperature 





0 to +70 
(Ambient) 


0 to +900) 






—55 to +125 
(Case) 















TBIAS Case Temperature|—55 to +125 | -65 to +135 | °C 
_ | Under Bias 0 to +906) 
TSTG Storage —55 to +125 | -65to +155] ° 
Temperature 
lin Input Voltage -—0.5to+7.0 |-0.5to+7.0] V 





NOTES: 2871 tbl 08 

1. Stresses greater than those listed under ABSOLUTE MAXIMUM RATINGS 
may cause permanent damage to the device. This is a stress rating only 
and functional operation of the device at these or any other conditions 
above those indicated in the operational sections of this specification is not 
implied. Exposure to absolute maximum rating conditions for extended 
periods may affect reliability. 

2. ViINminimum = —3.0V for pulse width less than 15ns. 
Vin should not exceed Vcc +0.5 Volts. 

3. Notmore than one outputshould be shorted at atime. Duration of the short 

should not exceed 30 seconds. 
. 16-33 MHz only. 
. 37-40 MHz only. 


as 


The master double frequency input clock used for generating SysOut. 


A double frequency clock input used to determine the sample point for data coming into the processor and 
coprocessors. 


A double frequency clock input used to determine the enable time of the cache RAMs. 


A double frequency clock input used to determine the position of the internal phases, phase1 and phase2. 


Synchronous initialization input used to force execution starting from the reset memory address. Reset must be 
deasseted synchronously but asserted asynchronously. The deassertion of Reset must be synchronized by the 







If the mode input is sampled as a'1' at the rising edge of reset, the connection between FpInt andthe CPU interrrupt 
bus will be established via the interrept reset vector. Vcc should be used to establish this option. 

lf mode is sampled as a 0 at the rising edge of reset, the Fp|nt willbe an output, and must be externally connected 
back to a CPU interrupt input pin. Ground should be used to select this option. 


lf the mode pin is sampled as a 0 at the rising edge of reset, this signal will be the FPInt output from the FPA core 
of the IDT79R3500, and must be externally connected back to a CPU interrupt pin. If the mode pin is sampled as 
a 1 at the rising edge of reset, this pin should be grounded. 3 
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RECOMMENDED OPERATING 
TEMPERATURE AND SUPPLY VOLTAGE 


| 












Temperature 





Military —55°C to +125°C - 5.04+10% 
20-25 MHZ (Case) . | 





Commercial 
20-33 MHz 


Commercial 
40 MHz 


0°C to +90°C 5.0 +5% 
(Case) 


2871 tbl 10 





0°C to +70°C OV 5.0 +5% 
(Ambient) 
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AC TEST CONDITIONS _ OUTPUT LOADING FOR AC TESTING ce | 


Sym bol Parameter 


IH Input HIGH Voitage 
IL 









Input LOW Voltage 


Input LOW Voltage ; 7 a 
2871 tbl 09 7 KO | | | 25pF 





To Device 
Under Test 
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DC ELECTRICAL CHARACTERISTICS 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Vcc = +5.0V +5%) 


Symbol! 


































Test Conditions ) min. | Max. | 
[Vor [OuputHiGH Votage | Veo=Min,ow=—4ma | 35 | — | 
Fen [input Capactance® [SCY S| 
fice | Operating Gurrent ‘| Voo=sV.Ta=70 «| —— ‘| 550 | 
ric__[ mputLow Leakage [Vi-GnD—~—=dC too | — ‘| 





NOTES: 2874 tbl 11 
1. Vit Min. = —3.0V for pulse width less than 15ns. Vit should not fall below —0.5 Volts for larger periods. f 7 
2. ViHs and VILs apply to Clk2xSys, Clk2xSmp, Clk2xRd, Clk2xPhi, CpBusy, and Reset. 
3. These parameters do not apply to the clock inputs. .. 2 ; 3 a 
4. VoHtand VoLT apply to the bidirectional data and tag busses only. Note that ViH and Vi_ also apply to these signals. VOHT and VoLT are provided 
to give the designer further information about these specific signals. 
5. ViH should not be held above Vcc + 0.5 volts. 
6. Guaranteed by design. 
7. VoHc applies to RUN and Exception. 
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DC ELECTRICAL CHARACTERISTICS 
MILITARY TEMPERATURE RANGE (Tc = -55°C to +125°C, Vcc = +5.0V +10%) 


79R3500 
Symbol Parameter 
V 
V 





Vcc = Min., oH = -4mA 
Vcc = Min., lol = 4mA 
Vcc = Min., loH = -4mA 
Vcc = Min., [oH = -8mA 
Vcc = Min., lo. = 8mA 
(2, 
2 
N 


Vo 
Vo 
| 
| 


Output Capacitance(®) 


NOTES: 2871 tbl 12 
1. Vit Min. = —3.0V for pulse width less than 15ns. Vit should not fall below —0.5 Volts for larger periods. 
2. ViHs and ViLs apply to Clk2xSys, CIk2xSmp, Clk2xRd, Cik2xPhi, CpBusy, and Reset. . 
3. These parameters do not apply to the clock inputs. 
4. Vout and VoLT apply to the bidirectional data and tag busses only. Note that ViH and Vi_ also apply to these signals. VoHT and VoLT are provided 
to give the designer further information about these specific signals. 
5. Vit should not be held above Vcc + 0.5 volts. 
6. Tested only initially, and after design changes which may affect capacitance. 
7. VoHc applies to RUN and Exception. 
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DC ELECTRICAL CHARACTERISTICS ee a 
COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to +90°C, Vcc =+5.0V+5%) = 


79R3500 
Symbol 


Output HIGH Voltage Vcc = Min., loH = 4mA | 
Output HIGH Voltage! Vcc = Min., oH = -4mA 





































_ Parameter Test Conditions 







—_ 
= 
= 


VOH 
VOL 
7) | 

[Wor [oupuLow votage®™7 [Voo=Min, r= ema 
Ce 
CIN 6 
ioc = 
IIH 3) 
IL 
loz 


a ee 
a 
Vins | InputHIGH Voltage") | 
1 
pCw [input Capacitance | 
[Cour | OutputCapacitance®) | 
( 
(3) 


Fico [Operating Curent | Voo=8V,TA=700 | 
[ic [input ow Leakage? [vusGno |e 
[ice [Output Tistate Leakage [Von= vo vor=end [100 —SO«dS 0 


NOTES: 2871 tbl 13 
1. Vit Min. = —3.0V for pulse width less than 15ns. Vi. should not fall below —0.5 Volts for larger periods. 
2. Vis and ViLs apply to Clk2xSys, Clk2xSmp, Clk2xRd, Cik2xPhi, CpBusy, and Reset. 
3. These parameters do not apply to the clock inputs. eae i : . i 
4. VoHTand VoLT apply to the bidirectional data and tag busses only. Note that ViH and Vit also apply to these signals. VoHT and VoLT are provided 
to give the designer further information about these specific signals. 
Vin should not be held above Vcc + 0.5 volts. 
Guaranteed by design. 
Vonc applies to RUN and Exception. 








NOM 
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AC ELECTRICAL CHARACTERISTICS(1:2.3) 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Voc = +5.0V +5%) 


79R3500 
[| 25.0MHz | —33.33MHz__—i| 














Clock 
el a 
| 8.0 
TckP | Input Clock Period 20 
Clk2xSys to Clk2xSmp\°) 0 
Clk2xSmp to Clk2xRad(°) 0 
Clk2xSmp to Clk2xPhil®) 5.0 





Run Operation 


[Data Vaid _————~*d Load apr | dao 
| —2 
erie) 





| 
ma thes 
ro) 


a 
: 
| 
oO 
: | 
on 


DVal | Data Valid 


i 
co) 
=) 
a 


N 
oO 
: 
p=) 
” 


(3) 
(3) 
TwrDly | Write Delay Load= 25pF 
Tos_[Data Setup 
DH | Data Hold‘) 
:0) 
(2) 


> 
oO 
= 
nD 


S 
” 


i 


+ 


ie) 
= 


usneaE 
fe ee ee! 
iene 
wr | Memory Wite [Load B5pF [| 23 


NO 
ro) 
| 
Nh 
on 


xc Load= 25pF 
val | Address Valid Load= 25pF 
Tints | Int(n) Set-up 


intH | Int(n) Hold 


tall Operation 


Tsaval | Address Valid 
TSAcTy 
Turd 1.0 
Tron | Run initiate ——=—=S~S~*diCa= 
SMW 3.0 
Tsexc 
Reset Initialization 

TRST | Reset Pulse Width Tceyc 
Tprwon! Power on (4) 


[3000_| 8000 | — | 3000 | — | Toye 
Capacitive Load Deration 
SS OO 


NOTES: 2871 tbl 14 
. All timings are referenced to 1.5V. 

The clock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Cik2xPhi. 

This parameter is guaranteed by design. 

Tcyc is one CPU clock cycle (two cycles of a 2x clock). 

With the exception of the Run signal, no two signals on a given device will derate for a given load by a difference greater than 15%. 

Clock transition time < 2.5ns for 33.33MHz; clock transition time < 5ns tor other speeds. 
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AC ELECTRICAL CHARACTERISTICS(1:2:3) | | 
MILITARY TEMPERATURE RANGE (Tc = -55°C to +125°C, Voc = +5.0V +10%) | 
. 79R3500 


Symbol __ Parameter Test Conditions | so Min. =] Max. 


Clock | 
Input Clock HIGH® [Notes = ss | to | CO Slt Of CU ots 
Tcktow | Input Clock LOW) Sa | = {ns | 


| Tektow 
TckP | Input Clock Period 

Clk2xSys to Clk2xSmp() 
Clk2xSmp to Clk2xRd(5) 
| Clk2xSmp to Clk2xPhi(°) 

ee eee ee 

ee 

|Load=25pF_ 


































Run Operation 


ros [Data Setup 
|ToH [DataHold) tC C*s‘“‘sisC 
eigemeae = ho 
rear [opsusyHad | 








Int(n) Set-up ae ee 
Int(n) Hold eee Seo 








Stall Operation 


Address Valid Load= 25pF 










, 
} 


Reset Initialization 


[trast [ResetPusewith | ToT OT oT tre | 
[Tewonf Poweron( T0000 S| — 000 Sf = Toye 


Capacitive Load Deration 


Load Derate(® a ee 


OTES: 2871 tbl 1 
. All timings are referenced to 1.5V. 

The clock parameters apply to all four 2xClocks: Cik2xSys, Cik2xSmp, Clk2xRd, and Cik2xPhi. 

This parameter is guaranteed by design. . 

Teye is one CPU clock cycle (two cycles of a 2x clock). 

With the exception of the Run signal, no two signals on a given device will derate for a given load by a difference greater than 15%. 

Clock transition time < 2.5ns for 33.33MHz; clock transition time < 5ns for other speeds. 


i = e 
Load= 25pF 
Memory Read Initiate Load= 25pF 
Memory Read Terminate Load= 25pF 





OnrwN 
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AC ELECTRICAL CHARACTERISTICS(.2,3) 
COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to +90°C, Vcc = +5.0V +5%) 





79R3500 











Symbol Parameter Test Conditions 


TckHigh| Input Clock HIGH(2) — Note 6 
Toxtow) Input Clock LOW® Note 6 


Input Clock Period 12.5 


2) 
2) 
re) 
ro 











O 
x 
U 
=: 
O 





Clk2xSys to Clk2xSmp") 0 
Clk2xSmp to Clk2xRd(°) 0 
Phi(5) 3.0 









Run Operation 


Data Enable(®) 
Topis | Data Disable‘) ene ee 
(1: 


— 
ol 
a) 
i?) 


Toval_| Data Valid Load 25pF oo 
Twroy | Write Delay Loads 25pF ee 






N 
fo) 
5 
a 


a) 
” 


Data Set-up 


oO 


QO 
n 


CpBusy Set-up eee ee as eee eee 
Tost | CpBusy Hol eee ee ee 
TAcTy | Access Type (1:0) Load= 25pF 


a) 
” 


Ree 
wwe [Memory Write _—=i( Load= opr | 
Ex Load= 25pF 
Taval oc ees (ian eacane aeaec 
Tits_[Int{nySet-up ae ere 
it | IMEC CNG ee es 
Deer ee a 
(a ec een eee eee ere (ee (ee 
Stall Operation 
saval|Address Valid | Load= 25pF_— | 
sacty| Access Type | Load=25pF_— | 
med [Memory Readinitiate | Load= 25pF_ | 
- a eee 


Ei 
= 
” 


CBS 
r 
c 


& 
ro) 
5 
n 


H__| Data Hold’) De | =2.5 





FF 


[> ) io) 
o1 lo) 
mo) = 












=) 
” 


3}; 3 
nmin 


Set Ped 
To 
ion 
Cc 
n 
<= 


12.5 


Sa 


Qe 
al feo 6 
=a 


3S | 5 
“nln 


Run Load= 25pF Sees 
SMW 
Tsexe [Exception Valid [Load=25pF_ | 
Reset Initialization 
TpwrOn | Power On(4) Re ae ee 

Capacitive Load Deration 


Load Derate(®) ps2 pF 


NOTES: 2871 tbl 16 
. All timings are referenced to 1.5V. 
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1 

2. The clock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Clk2xPhi. 

3. This parameter is guaranteed by design. 

4. Tcyc is one CPU clock cycle (two cycles of a 2x clock). 

5. With the exception of the Run signal, no two signals on a given device will derate for a given load by a difference greater than 15%. 
6. Clock transition time < 2.5ns. 
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Phase : an 2 | 2 1 
ee : Tsys 
an 

— FpPhiOut 
Font a, 
Tfpint Tfpint 
2871 drw 25 
Figure 13. Floating Point Interrupt 

Phase , 

FpSysOut 





—— TStallS | 
mama =D TStallH 


TRExS == TEXH 
Run 


Figure 14. Floating Point Busy 2871 drw 26 | 
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Tcklow Tckp Tekhigh 


Clk2xSys 
Tsmp 


Cik2xSmd 


\ 


Clk2xRd 


Cik2xPhi 
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Figure 15. Input Clock Timing 


Tcyo 
SysOut 


SmpOut" 


RdOut* 


PhiOut* 
Tsys 


2871 drw 20 
Figure 16. Processor Reference Clock Timing 


“These signals are not actually output from the processor. 
They are drawn to provide a reference for other timing diagrams. 
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Phase : | 2 1 2 . 7 i 23 






il 
il 


AddrLo D Addr K | Addr K D Addr K Addr 
AccTyp 0:1 = Size of Stored Data | | Size of Load Data | 
| Bus 
Input 
Tdh 
Data and 
Tag Buses 


Z| dale 


IClk 


DClk 





DWr 





Twrdly 


2871 drw 21 


Figure 17. Synchronous Memory (Cache) Timing 
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RUN STALL STALL FIXUP RUN 


Phase 1 2 1 2 1 2 1 2 1 
SysOut Tsys Tsys 
addi vee ns i pest Noel 
en 
“ ei oi 
High) Trd 
Tacty Tacty 
rectpos KX Daas») Ko tease > 


ata Size 
mgel TT Tp 
ee cs 
=e Se = 
Data i N ee. 


(Output) fo $ 
Tden Trd Tden 
mi 
coeaune Tmwr 
MemWr on 
| "ie <=) Tsmp 
WrBusy LT, 
a Trun 
Run 
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AccTyp 2 








O 
= 


Figure 18. Memory Write Timing 
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Phase. . 


SysOut 
— PhiOut 
AddrLo 


Tag 
(Address 
High) 


AccTyp 0:1 


AccTyp 2 


Data 
(Input) 


MemRd 


RdBusy 


a 
© 
3 


CpCondo 


Run 


- s 5 T 
Read Address | , es ¢ D Addr x | Addr» 
Read Address 








Figure 19. Memory Read Timing 
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Coprocessor Store Coprocessor Load 




















Phase 1 
nee | ys 
PhiOut | 
| Bus D Bus 
Input Input 
Tdh Tdh 
Data Bus ay 2D <p 
Trun Tstl 
= Tk LZ 
a ee 
a  ,  S 
eens 
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nae ee 
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onenen 
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Figure 20. Coprocessor Load/Store Timing 
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Phase 1 2 1 2 . 1 
eTeT" Tsys Tsys 
7 | Tsmp aaa aaa 
oe TintH as a TintH 
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Figure 21. Interrupt Timing 
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Figure 22. Mode Vector Initialization 


NOTES: 

1. Reset must be negated synchronously; however, it should be asserted asynchronously. Designs must not rely on the proper functioning of SysOut prior 
to the assertion of Reset. 

2. Reset is actually sampled in both Phase 1 and Phase 2. To insure proper initialization, it must be negated relative to the end of Phase 1. 
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ORDERING INFORMATION 





IDT XXXXX — XX X X 
Device Type Speed Package Process/ 
Temperature 
Range 








Blank Commercial (0°C to +70°C) 
B Military (-55°C to +125°C) 
Compliant to MIL-STD-883, Class B 
M Military Temperature Range Only 
G 161-Pin PGA (Cavity Down) 
MD 160-Pin MQUAD 
F 172-pin Flatpack (Cavity Up) 
16 16.67 MHz 
20 20.0 MHz 
25 25.0 MHz 
33 33.33 MHz 
40 40.0 MHz 
79R3500 RISCore CPU Processor 
2871 drw 27 
VALID COMBINATIONS 
IDT 79R3500 - 20,25,33,40 G, MD 
79R3500 - 20,25,-B,M G).F 
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| | iwT79R3041™ 






Integrated Device Technology, Inc. 






INTEGRATED RISController™ 
FOR LOW-COST SYSTEMS 


ADVANCE 
INFORMATION | 
IDT79R3041 









FEATURES: : 
¢ Instruction set compatible with |IDT79R3000A and R3051™ 
FamilyRISCCPUs ~ © = a 
¢ High level of integration minimizes system cost 
— RISCCPU). ae, a 
— Multiply/divide unit 
— Instruction Cache’ — 
— DataCache ~~ ... 
— Programmable bus interface 
— Programmable port width support 
¢ On-chip instruction and data caches 
— 2kB of Instruction Cache 
— 512B of Data Cache 
¢ Flexible bus interface allows simple, low cost designs 
— Superset pin-compatible with R3051 
— Adds programmable port width interface 
(8-, 16-, and 32-bit memory sub-regions) 
— Adds programmable bus interface timing support 
(Extended address hold, Bus turn around time, 
Read/write masks) 


e e e e 


e 


Single, double-frequency clock input 

16 and 20MHz operation 

14 MIPS at 20MHz 

Low cost 84-pin PLCC packaging 

On-chip 4-deep write buffer eliminates memory write stalls 
On-chip 4-word read buffer supports burst or simple block 
reads | 

On-chip DMA arbiter 

On-chip 24-bit timer , 

Boot from 8-bit, 16-bit, or 32-bit wide boot PROMs 

Pin- and software-compatible family includes R3041™, 
R3051™, R3052™, and R3081™ 

Complete software support 

— Optimizing compilers 

— Real-time operating systems 

— Monitors/debuggers a 

— Floating Point Software 

— Page Description Languages 
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4-deep | 4-dee 
Write 
Buffer 


| Buffer 


Address/ 
Data 












Exception/Control 
egisters 
Bus Interface 

Registers 

PortSize 
Register 
Counter 

Registers 


| Virtual Address 


DMA Rd/Wr_ SysCik 
Ctrl 
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Figure 1. R3041 Block Diagram 
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INTRODUCTION 


The IDT R3051 family is a series of high-performance 32- 
bit microprocessors featuring a high-level of integration, and 
targeted to high-performance but cost sensitive embedded 
processing applications. The R3051 family is designed to 
bring the high-performance inherent in the MIPS RISC archi- 
tecture into low-cost, simplified, power sensitive applications. 

Thus, functional units have been integrated onto the CPU 
core in order to reduce the total system cost, rather than to 
increase the inherent performance of the integer engine. 
Nevertheless, the R3051 family is able to offer 35 MIPS of 
integer performance at 40 MHz without requiring external 
SRAM or caches. 

Further, the R3051 family brings dramatic power reduction 
to these embedded applications, allowing the use of low-cost 
packaging. Thus, the R3051 family allows customer applica- 
tions to bring maximum performance at minimum cost. 

The R3041 extends the range of price/performance achiev- 
able with the R3051 family, by dramatically lowering the cost 






R308 1 16kB 4kB 
or 8kB or 8KB 





Name Cache Cache Point Options 
R3051 | 4kB 2kB Software Emulation Mux’ed A/D — 


On-chip Hardware 


R3041 5128 Software Emulation 


of using the MIPS architecture. The R3041 has been de- 
signed to achieve minimal system and components cost, yet 
maintain the high-performance inherent in the MIPS architec- 
ture. The R3041 also maintains pin and software compatibility 
with the R3051 and R3081. 

The R3051 family offers a variety of price/performance 
features in a pin-compatible, software compatible family. 
Table 1 provides an overview of the current members of the 
R3051 family. Note that the R3051, R3052, and R3081 are 
also available in pin-compatible versions that include a full- 
function memory management unit, including 64-entry TLB. 
The R3051/2 and R3081 are described in separate manuals 
and data sheets. 

Figure 1 shows a block level representation of the func- 
tional units within the R3041. The R3041 canbe viewed as the 
embodiment of a discrete solution built around the R3000A. 
By integrating this functionality on a single chip, dramatic cost 
and power reductions are achieved. 

An overview of these blocks is presented here, followed 
with detailed information on each block. 








1/2 frequency bus option 









8-, 16-, and 32-bit port widths support 
Programmable timing support 
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Table 1. Pin compatible R3051 Family 


CPU Core 


The CPU coreisa full 32-bit RISC integer execution engine, 
capable of sustaining close to single cycle execution rate. The 
CPUcore contains a five stage pipeline, and 32 orthogonal 32- 
bitregisters. The R3051 family implements the MIPS-IISA. In 
fact, the execution engine of the R3041 is the same as the 
execution engine of the R3000A. Thus, the R3041 is binary 
compatible with those CPU engines, as well as compatible 
with other members of the R3051 family. 


#1 





l#2 


l#3 


l#4 


l#5 


Cie Ro [ALU [We Ws 
CF [RS [ALU [we we 
CF [Rb | ALU eM we 


The execution engine of the R3051 family uses a five-stage 
pipeline to achieve close to single cycle execution. A new 
instruction can be started in every clock cycle; the execution 
engine actually processes five instructions concurrently (in 
various pipeline stages). Figure 2 shows the concurrency 
achieved by the R3051 family pipeline. 











iF | RD [ALU [MEM] We | 





Current 
CPU 
Cycle 


Figure 2. R3051 Family 5-Stage Pipeline 
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System Control.Co-Processor . 


The R3041 also integrates on-chip a emigat Control Co- 
processor, CPO. CPO manages the exception handling capa- 
bility of the R3041, the virtual to physical address mapping of 
the R3041, and the programmable bus interface capabilities 
of the R3041: These sOpICS are arene. in | subsequent 
sections. 

The R3041 does not include the aotionat TLB fouiid’ in other 
members of the R3051 family, but instead performs the same 
virtual to physical address mapping of the base versions of the 
R3051 family. These devices still support distinct kernel and 
user mode operation, but do not require page management 
software or an on-chip TLB, leading to a Sine software 
model and a lower-cost processor. 

The memory mapping used by these devices is illustrated 
in figure 3. Note that the reserved address spaces shown are 
for compatibility with future family members; in the current 
family members, references to these addresses are trans- 
lated in the same fashion as their respective segments, with 
no traps or exceptions taken. 

When using the base versions of the architecture, the 
system designer can implement a distinction between the 
user tasks and the kernel tasks, without having to execute 


page management software. This distinction can take the. 


dress decoding, or in other forms. In systems which do not 
wish to implement memory protection, and wish to have the 
kernel and user tasks operate out of a single unified memory 
space, upper address lines can be ignored by the address 
decoder, and thus all references will be seen in the lower 
gigabyte of the physical address space. 

The R3041 adds additional resources into the on- chip CPO. 
These resources are detailed in the R3041 User's Manual. 
They allow kernel: software. to directly control activity of the 
processor internal resources and bus interface, and include: 
¢ Cache Configuration Register: This register controls the 
data Cache miss refill algorithm used, and also controls 

some of the debug features of the R3041. 

Bus Control Register. This register controls the behavior 

of various bus interface signals. _ 

Count and Compare Registers. Together, these registers 

implement a programmable 24-bit timer, which canbe used 

for DRAM refresh or as a general purpose timer. 

PortSize Control Register. This register allows the kernel 

to indicate the port width of reads and writes to various sub- 

regions of the physical address space. Thus, the R3041 can 

interface directly with 8-, 16-, and 32-bit memory ports, 

including a mix of sizes, for both instruction and data 
_references, without requiring external logic. 


e 


e 


e 


form of physical memory pene: accomplished by ad- 7 
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Figure 3. Virtual to ee Mapping of Base Architecture Versions 


Clock Generation Unit 3 
The R3041 is driven from a single input clock, capable of 


operating in a range of 40%-60% duty cycle. On-chip, the 
clock generator unit is responsible for managing the interac-: 


tion of the CPU core, caches, and bus interface. The clock 
generator unit replaces the external delay line required in 
R3000A based applications. 


| ‘Instruction Cache 


The R3041 integrates 2kB of on-chip Instruction Cache, 
organized with a line size of 16 bytes (four 32-bit entries). This 
relatively large cache substantially contributes to the perfor- 


mance inherent in the R3041, and allows systems based on 


the R3041 to achieve high-performance even from low-cost 
memory systems. The cache is implemented as a direct 
mapped cache, and is capable of caching instructions from 
anywhere within the 4GB physical address space. The cache 


_ {§ implemented using physical addresses and physical tags 


(rather than virtual addresses or tags), and thus does not 
require flushing on context switch. 
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Data Cache 


The R3041 incorporates an on-chip data cache of 512B, 
organized as a line size of 4 bytes (one word). This relatively 
large data cache contributes substantially to the performance 
inherentin the R3051 family. As with the instruction cache, the 
data cache is implemented as a direct mapped physical 
address cache. The cache is capable of mapping any word 
within the 4GB physical address space. 

The data cache is implemented as a write through cache, 
to insure that main memory is always consistent with the 
internal cache. In order to minimize processor stalls due to 
data write operations, the bus interface unit incorporates a 4- 
deep write buffer which captures address and data at the 
processor execution rate, allowing it to be retired to main 
memory at a much slower rate without impacting system 
performance. 


Bus Interface Unit 


The R3051 family uses its large internal caches to provide 
the majority of the bandwidth requirements of the execution 
engine, and thus can utilize a simple bus interface connected 
to slow memory devices. 

The R3051 family bus interface utilizes a 32-bit address 
and data bus multiplexed onto a single set of pins. The bus 
interface unit also provides an ALE (Address Latch Enable) 
output signal to de-multiplex the A/D bus, and simple hand- 
shake signals to process CPU read and write requests. In 
addition to the read and write interface, the R3041 incorpo- 
rates a DMA arbiter, to allow an external master to contro! the 
external bus. 

The R3041 augments the basic R3051 bus interface capa- 
bility by adding the ability to directly interface with varying 
memory port widths, for instructions or data. Thus, the R3041 
can be used in a system with an 8-bit boot PROM, 16-bit font 
cartridges, and 32-bit page buffer, transparently to software, 
and without requiring external data packing, rotation, and 
unpacking. 

In addition, the R3041 incorporates the ability to change 
some of the interface timing of the bus. These features can be 


used to eliminate external data buffers, and take advantage of 
lower speed (lower cost) interface components. 

One of the bus interface options is the Extended Address 
Hold mode which adds 1/2 clock of extra address hold time 
from ALE falling. This allows easier interfacing to FPGAs and 
ASICs. | 

The R3041 incorporates a 4-deep write buffer to decouple 
the speed of the execution engine from the speed of the 
memory system. The write buffers capture and FIFO proces- 
sor address and data information in store operations, and 
present it to the bus interface as write transactions at the rate 
the memory system can accommodate. During main memory 
writes, the R3041 can break a large datum (e.g. 32-bit word) 
into a series of smaller transactions (e.g. bytes), according to 
the width of the memory port being written. This operation is 
transparent to the software which initiated the store, insuring 
that the same software can run in true 32-bit memory systems. 

The R3051 family read interface performs both single word 
reads and quad word reads. Single word reads work with a 
simple handshake, and quad word reads can either utilize the 
simple handshake (in lower performance, simple systems) or 
utilize a tighter timing mode when the memory system can 
burst data at the processor clock rate. Thus, the system 
designer can choose to utilize page or nibble mode DRAMs 
(and possibly use interleaving, if desired, in high-performance 
systems), or use simpler techniques to reduce complexity. 

In order to accommodate slower quad word reads, the 
R3051 family incorporates a 4-deep read buffer FIFO, so that 
the external interface can queue up data within the processor 


before releasing it to perform a burst fill of the internal caches. 


5.4 


In addition, the R3041 can perform on-chip data packing 
when performing large datum reads (e.g., quad words) from 
narrower memory systems (e.g., 16-bits). Once again, this 
operation is transparent to the actual software, simplifying 
migration of software to higher performance (true 32-bit) 
systems, and simplifying field upgrades to wider memory. 
Since this capability works for either instruction or data reads, 
using 8-, 16-, or 32-bit boot PROMs is easily supported by the 
R3041. 
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SYSTEM USAGE 


The IDT R3051 family has been specifically. designed to 
easily connect to low-cost memory systems. Typical low-cost 
memory systems utilize slow EPROMs, DRAMs, and /applica- 
tion specific peripherals. 

Figure 4 shows some of the flexibility inherent in the R3041. 
In this example system, which is typical ofa laser printer, a 32- 
bit PROM interface is used due to the size of the PDL 
interpreter. An embedded system can use an 8-bit boot 
PROM instead. A 16-bit font cartridge interface is provided for 
add-in cards and a 16-bit page buffer is used for low cost. In 


this system, a field or manufacturing upgrade to a 32-bit page 
buffer is supported by the boot software and DRAM controller. 
Such a system features a very low entry price, with a range of 
field upgrade options including the ability to upgrade toamore 
powerful member of the R3051 family. 

The IDT R3051 family has been specifically designed to 
easily connect to low-cost memory systems. Typical low-cost 
memory systems utilize slow EPROMs, DRAMs, and applica- 
tion specific peripherals. Embedded systems may typically 
substitute static RAMs for the DRAMs. 
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Figure 4. Typical R3041-Based Application 


5.4 
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DEVELOPMENT SUPPORT 


The IDT R3051 family is supported by a rich set of develop- 
ment tools, ranging from system simulation tools through 
PROM monitor and debug support, applications software and 
utility libraries, logic analysis tools, and sub-system modules. 
Figure 5is an overview of the system development process 
typically used when developing R3041 applications. The 
R3051 family is supported in all phases of project develop- 
ment. These tools allow timely, parallel development of 
hardware and software for R3051 family based applications, 
and include tools such as: 
¢ A program, Cache-3041, which allows the performance of 
an R3041 based system to be modeled and understood 
without requiring actual hardware. 

¢ Sable, an instruction set simulator. 

¢ Optimizing compilers from MIPS Technology, the acknowl- 
edged leader in optimizing compiler technology. 


e 


e 


e 


e 


Cross development tools, available in a variety of develop- 
ment environments. 

The high-performance IDT floating point emulation library 
software. 

The IDT Evaluation Board, which includes RAM, EPROM, 
/O, and the IDT PROM Monitor. | 

The IDT Laser Printer System board, which directly drives 
a low-cost print engine, and runs Microsoft Truelmage™ 
Page Description Language on top of PeerlessPage™ 
Advanced Printer Controller BIOS. 

Adobe PostScript™ Page Description Language running 
on the IDT R3051 family. 

The IDT/sim PROM Monitor, which implements a full PROM 
monitor (diagnostics, remote debug support, peek/poke, 
etc.). | 

IDT/kit (Kernel Integration Toolkit), providing library support 
and a frame work for the system run time environment. 


System System System 
Architecture Development Integration 
Evaluation Phase and Verfification 


SABLE Simulator 
DBG Debugger 
PIXIE Profiler 
MIPS Compiler Suite 
Stand-Alone Libraries 
Floating Point Library 





Cache-3041 
Benchmarks | 
Evaluation Board 
Laser Printer System 


Cross Development Tools 
Adobe PostScript PDL _f! 
MicroSoft Truelmage PDL f 
PeerlessPage BIOS 3 
T/kit 


e-3041 


Cac 
Hardware Models 
General CAD Tools 
RISC Sub-systems 
Evaluation Board 
Laser Printer System 





Figure 5. R3041 Development Environment 
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PERFORMANCE OVERVIEW 


The R3051 family achieves a very ‘high- -level of perfor- 
mance. This performance is based on: _ 
An efficient execution engine. The CPU performs ALU 
operations and store operations in a single cycle, and has 
an effective load time of 1.3 cycles, and branch execution 
rate of 1.5 cycles (based on the ability of the compilers to 
avoid software interlocks). Thus, the R3041 achieves over 
16 MIPS performance when operating out of cache. 
Large on-chip caches. The R3051 family contains caches 
which are substantially larger than those on the majority of 
embedded microprocessors. These large caches minimize 
the number of bus transactions required, and allow the 
R3051 family to achieve actual sustained performance very 
close to its peak execution rate, even with low cost memory 
systems. | | 
Autonomous multiply and divide operations. The R3051 
family features an on-chip integer multiplier/divide unit 
which is separate from the other ALU. This allows the 
R3041 to perform multiply or divide operations in parallel 
with other integer operations, using a single multiply or 
divide instruction rather than using “step” operations. 
Integrated write buffer. The R3041 features a four deep 
write buffer, which captures store target addresses and data 
at the processor execution rate and retires it to main 
memory at the slower main memory access rate. Use of on- 
chip write buffers eliminates the need for the processor to 
Stall when performing store operations. 
Burst read support. The R3041 enables the system de- 


° 


e 


signer to utilize page mode, static column, or nibble mode. 
RAMs when performing read operations to minimize the — 


main memory read penalty and increase the effective cache 
hit rates. 


The performance differences among the various R3051 
family members depends on the application software and the 
design of the memory system. Different family members 
feature different cache sizes, and the R3081 features a 
hardware floating point accelerator. Since all these devices 


can be used in a pin and software compatible fashion, the — 


system designer has maximum freedom in trading between 
performance and cost. The memory simulation tools (e.g. 
Cache-3041) allows the system designers to analyze and 
understand the performance differences among these de- 
vices in their application. 


5.4 


SELECTABLE FEATURES - 


The R3051 family allows the system designer to sentiauie 
some aspects of operation. 

Some of these configuration options are under the software 
control of the kernel, and are contained in the System Control 
Co-Processor registers. Others are established via. mode 
inputs sampled at the negating edge of device reset. 
Selectable features include: 
¢ BigEndian vs. LittleEndian operation: The part can be 
configured to operate with either byte ordering convention, 
and in fact may also be dynamically switched between the 
two conventions.. This facilitates the porting of applications 
from other processor architectures, and also permits inter- 
communication between various types of pipes esoi> and 
databases. 

Data cache refill of one or four words: The memory 
system must be capable of performing 4 word transfers.to 
satisfy instruction cache misses and 1 word transfers to 
satisfy uncached references. The data cache refill size 
option allows the system designers to choose between one 
and four word refill on data cache misses, depending on the 
performance each option brings to their application. 
Bus Turn Around Speed: The R3041 allows the kernel to 
increase the amount of time between bus transactions 
when changes in bus direction occur (e.g. the end of reads). 
This allows transceivers and buffers to be eliminated from 
the system. _ | 
Extended Address Hold Time: The R3041 allows the 
_ system designer to increase the amountofholdtime available 
for address latching, thus allowing slower speed (low cost) 
address latches, FPGAs and ASICs to be used. 
Programmable control signals: The R3041 allows the 
system designer to optimally configure various memory 
control signals. to simplify external logic, thus reducing 
system cost. | 
Variable Memory Port Widths: The R3041 allows the © 
kernel to partition the physical memory space into various 
_ gub-regions, and to individually indicate the port width of 
these sub-regions. Thus, the bus interface unit can perform 
data packing and unpacking when communicating with 
narrow memory sub-regions. For example, these features, 
can be used to allow the R3041 to interface with narrow boot 
PROMS, or to implement 16-bit only memory systems. 
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THERMAL CONSIDERATIONS 


The R3051 family utilizes special packaging techniques to 
improve the thermal properties of high-speed processors. 
Thus, all versions of the R3051 family are packaged in cavity 
down packaging. 

The lowest cost members of the family use a standard 
cavity down, injection molded PLCC package (the “J” pack- 
age). This package is used for all speeds of the R3041 family. 

Higher speed and higher performance members of the 
R3051 family utilize more advanced packaging techniques to 
dissipate power while remaining both low-cost and pin- and 
socket- compatible with the PLCC package. Thus, these 
members of the R3051 family are available in the MQUAD 
package (the “MJ” package), which is an all aluminum pack- 
age with the die attached to a normal copper lead-frame 
mounted to the aluminum casing. The MQUAD package is 
pin and form compatible with the PLCC package. Thus, 
designers can choose to utilize this package without changing 
their PCB. 


The members of the R3051 family are guaranteed ina case 
temperature range of 0°C to +95°C. The type of package, 
speed (power) of the device, and airflow conditions, affect the 
equivalent ambient conditions which meet this specification. 

The equivalent allowable ambient temperature, TA, can be 
calculated using the thermal resistance from case to ambient 
(OcA) of the given package. The following equation relates 
ambient and case temperature: 

TA=TC-P*@CA 

where P is the maximum power consumption at hot tem- 
perature, calculated by using the maximum Icc specification 
for the device. 

Typical values for OCA at various airflows are shown in 
Table 2 for the PLCC package. 









a 





; 2905 tbl 02 
Table 2. Thermal Resistance (@ca) at Various Airflows 
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PIN DESCHIE HON 


PINNAME | vo |. DESCRIPTION 


A/D(31:0) = | | Address/Data: ‘A 32-bit time multiplexed bus which indicates the desired address for a ‘bus transaction 
i epee in one phase, and whichis used to transmit data between the CPU and external memory resources during 
” .the'rest of the transfer. 


Bus transactions on this bus are logically Spaiaieal into two. shoes: during the first phase, information 
about the transfer is presented to the memory system to be captured using the ALE output. This 
information consists of: 


Address(31:4): us The Kile: sarees for the transfer is re on.A/D(31 4). 2 


BE(3:0): __ These strobes indicate which bytes of the 32-bit bus will be involved in: 
: -. the transfer, and are presented on A/D(3:0). BE(3) indicates that — 
_ A/D(31:24) will be used, and BE(0) corresponds to A/D(7:0). These 
_ strobes: are only valid for accesses to 32-bit wide memory ports... Note 
that BE(3:0) can be held in-active during reads by setting the appropriate 
bit of CPO; thus, these signals could be en used as Write Enable. 
strobes. , , | 


During the second phase, these signals are the data bus for the transaction. 


Data(31:0): During write cycles, the bus contains the data to be stored and is driven 
from the internal write buffer. 
On read cycles, the bus receives the data from the external resource, in 
either a single data transaction or in a burst of four words, and places it 
into the on-chip read buffer. 


The byte lanes used during the transfer are a function of the datum size, 
the memory port width, and the system byte-ordering. 






Low Address (3:0) A 4-bit bus which indicates which word/halfword/byte is currently expected by the 
processor. For 32-bit port widths, only Addr(3:2) is valid during the transfer; for 16-bit port widths, only 
Addr(3:1) are valid; for 8-bit port widths, all of Addr(3:0) are valid. These address lines always contain 
the address of the current datum to be transferred. In writes and single datum reads, the addresses initially 
output the specific target address, and will increment if the size of the datum is wider than the target 
memory port. For quad word reads, these outputs function as a counter starting at '0000', and 
incrementing according to the width of the memory port. 


The R3041 Addr(1:0) output pins are designated as the Rsvd(1:0) pins in the R3051 and R3081. 














Diagnostic Pin. This output indicates whether the current bus read transaction is due to an on- 
chip cache miss and whether the read is an instruction or data, andis time multiplexed as described below: 






Cached: During the phase in which the A/D bus presents address information, this 
pin is an active high output which indicates whether or not the current 
read is a result of acache miss. The value of this pin at this time other 
than in read cycles is undefined. 








VD: A high at this time indicates an instruction reference, and a low indicates 
a data reference. The value of this pin at this time other than in read 
cycles is undefined. 


The R3041 Diag output pin is designated as Diag(1) in the R3051 and R3081. 


Address Latch Enable: Used to indicate that the A/D bus contains valid address information for 
the bus transaction. This signal is used by external logic to capture the address for the transfer, typically | 
using transparent latches. 























Data Enable: This signal indicates that the A/D bus is no longer being driven by the processor 
during read cycles, and thus the external memory system may enable the drivers of the memory 
system onto this bus without having a bus conflict occur. During write cycles, or when no bus 

transaction is occurring, this signal is negated, thus disabling the external memory drivers. 
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PIN DESCRIPTION (Continued): 


Burst Transfer/Write Near: On read transactions, the Burst signal indicates that the current bus read 
is requesting a block of four contiguous words from memory. This signal is asserted only in read cycles 
due to cache misses; it is asserted for all |-Cache miss read cycles, and for D-Cache miss read cycles 
if selected in the CPO Cache Config Register. 







On write transactions, the WrNear output tells the external memory system that the bus interface unit 
is performing back-to-back write transactions to an address within the same 256 byte page as the prior 
write transaction. This signal is useful in memory systems which employ page mode or static column 
DRAMs, and allows nearby writes to be retired quickly. 


Read: An output which indicates that the current bus transaction is a read. 
Write: An output which indicates that the current bus transaction is a write. 


Acknowledge: An input which indicates to the device that the memory system has sufficiently 
processed the bus transaction. On write transactions, this signal indicates that the CPU may either 
progress to the next data item (for mini-burst writes of wide datums to narrow memories), or terminate 
the write cycle. On read transactions, this signal indicates that the memory system has sufficiently 
processed the read, and that the processor core may begin processing the data from this read transfer. 















Read Buffer Clock Enable: An input which indicates to the device that the memory system has 
placed valid data on the A/D bus, and that the processor may move the data into the on-chip Read 
Buffer. 


System Reference Clock: An output from the CPU which reflects the timing of the internal 
processor "Sys" clock. This clock is used to control state transitions in the read buffer, write buffer, 
memory controller, and bus interface unit. 















DMA Arbiter Bus Request: An input to the device which requests that the CPU tri-state its bus 
interface signals so thatthey may be driven by an external master. The negation ofthis input relinquishes 
mastership back to the CPU. 














DMA Arbiter Bus Grant. An output from the CPU used to acknowledge that a BusReq has been 
detected, and that the bus is relinquished to the external master. 


The R3041 adds an additional DMA protocol, under the control of CPO. If the DMA Protocol is enabled, 
the R3041 can request that the external master relinquish bus mastership back to the processor by 
negating the BusGnt output early, and waiting for the BusReq input to be negated. 


ius) 
Cc 
G 
G) 
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SBrCond(3)/ | VO 


lOStrobe 


Branch Condition Port/IO Strobe: The use of this signal depends on the setting of various bits of the 
CPO Bus Control register. If BrCond mode is selected, this input is logically connected to CoCond(3), 
and can be used by the branch on co-processor condition instructions as an input port. The SBrCond(3) 
input has special internal logie to synchronize the input, and thus may be driven by asynchronous 
agents. 












If this pin is selected to function as IOStrobe, it may be asserted as an output on reads, writes, or both, 
as programmed into CPO. This strobe asserts in the second clock cycle of a ranstes, and thus can be 
used to strobe various control signals from the bus interface. 












SBrCond(2)/ /O 


ExtDataEn 


Branch Condition Port/Extended Data Enable: The use of this signal depends on the settings in the 
CPO Bus Control register. If BrCond mode is selected, this input is logically connected to CpCond(2), 
and can be used by the branch on co-processor condition instructions as an input port. The SBrCond(2) 
input has special internal logic to synchronize the input, and thus may be driven by asynchronous 
agents. 







If this pin is selected to function as Extended Data Enable, it may be asserted as an output on reads, 
writes, or both, as programmed into CPO. This strobe can be used as an extended data enable strobe, 
in that itis held asserted for one-half clock cycle after the negation of Rd or Wr. This signal may typically 
be used as a write enable control line for transceivers, as a write line for I/O, or as an address mux select 
for DRAMs. 


Memory Strobe: This active low output pulses low for each data read or written, as configured in the 
CPO Bus Control register. Thus, it can be used as a read strobe, write strobe, or both, for SRAM type 
memories or for I/O devices. 


In the R3051 and R3081, this pin is designated as the BrCond(0) input. 













MemStrobe 
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PIN DESCRIPTION (Continued): 
DESCRIPTION 


Byte Enable Strobes for 16-bit Memory Port: These active low outputs are the byte lane strobes for. 
.. accesses to 16-bit wide memory ports; they are not necessarily valid for 8- or 32-bit wide ports. If BE16(1) 
. is asserted, thenthe most significant byte (either D(31:24) or D(15:8), depending on system endianness) 
is going to be used in this transfer. If BE16(0) is asserted, the least significant byte (D(23:16) or D(7:0)). 
will be used. : 


BEI 6(1:0):can be held inactive (masked) during ead transfers, according to the programming of the CPO 
Bus Control register. . ; 


Inthe R3051 and R3081, these pins are designated as Rsvd(3: 2). 


_ Last Datum in Mini-Burst: This active low output indi icates that this is the last datum transfer in a given 
transaction. Itis asserted after the hem to az eeCen leads) or Ack (writes), and is negated \ when Rd- 
or Wr is negated. 


The Last output pin is designated i in the R3051 and R3081 as s the Diag(0) output pin. 


- Terminal Count: This is an active low output from the processor which indicates that the on-chip timer 
has reached its terminal count. It will remain low for either 1.5 clock cycles, or until software resets the 
timer, depending on the mode selected in the CPO Bus Control register. Thus, the on-chip timer can 
function either as a free running timer for system functions such as DRAM refresh, or can operate as a 
software controlled time-slice timer, or real-time clock. 


The TC output pin is designated in the R3051 as the BrCond(1) input, and inthe R3081 asthe Run output. 


Bus Error: Input to the bus interface unit to terminate a bus transaction due to an external bus error. 
This signal is only sampled during read and write operations. If the bus transaction is aread operation, 
then the CPU will take a bus error exception. 


Processor Interrupt: During normal operation, these signals are jogioally t the same as the Tnt(5: 0) 
signals of the R3000. During processor reset, these signals perform mode initialization of the CPU, but 
_in a different (simpler) fashion than the interrupt signals of the R3000. 





There are two types of interrupt inputs: the Sint inputs are internally synchronized by the processor, 
and may be driven by an asynchronous external agent. The direct interrupt inputs are not internally 
synchronized, and thus must be externally synchronized to the CPU. The direct interrupt inputs have 
one cycle lower latency than the synchronized interrupts. » 


Master Clock Input: This is a double frequency input used to control the timing of the CPU. 


‘Master Processor Reset: This signal i initializes the CPU. Reset initialization mode selection is . 
performed during the last cycle of Reset. 


Tri-State: This input to the:R3041 requests that the R3041 tri-state all of its outputs. In addition to those 
outputs tri-stated during DMA, tri-state will cause SysCik, TC, and BusGnt to tri-state. This signal is 
intended for use during board testing and emulation during debug and board manufacture. 


inthe R3051 and R3081, this input pin is designated:as Rsvd(4). 
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ABSOLUTE MAXIMUM RATINGS": 2) | 
[Symbol| Rating ~——_—| Commercial | Unit 


VTERM | Terminal Voltage with —0.5 to +7.0 V 
Respect to GND 
Operating Case Temperature 
TemperatureUnder Bias —55 to +125 


Storage Temperature —55 to +125 


NOTE: 2905 tbl 06 

1. Stresses greater than those listed under ABSOLUTE MAXIMUM RATINGS 
may cause permanent damage to the device. This is a stress rating only 
and functional operation of the device at these or any other conditions 
above those indicated in the operational sections of this specification is not 
implied. Exposure to absolute maximum rating conditions for extended 
periods may affect reliability. 

2. VIN minimum = —3.0V for pulse width less than 15ns. 
Vin should not exceed Vcc +0.5 Volts. 

3. Notmore than one outputshould be shorted ata time. Duration of the short 
should not exceed 30 seconds. 







AC TEST CONDITIONS 


[symbol] Parameter | Min. | Max. 
vn | Tapa Hioh votage [30 | — 
es 
7S 







| 
ns [input HIGH votage [as [| — 
Tvs | inpattow vorage | — | oa |v — 
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RECOMMENDED OPERATING | 
TEMPERATURE AND SUPPLY VOLTAGE 


| Grade___ | Temperature 


~ Commercial O0°C to +95°C 
(Case) 






| GND | 
5.0+5% 
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OUTPUT LOADING FOR AC TESTING 


To Device 
Under Test 


ii CLD 
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[Signal] CC 
All Signals 25 pF 
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DC ELECTRICAL CHARACTERISTICS— (Tc = 


Symbol | Parameter 


Output HIGH Voltage Vec=Min, loH=-4mA Sy 
Output LOW Voltage — Vcc = Min., loL = 4mA | | — 0.4 


0°C to +95°C, Vcc = +5.0V +5%) 


| 16.67MHz | 20 MHz a 









| Test Conditions 





35 Se 





"Input LOW Voltage‘ 


VIHS 





OH 

VoL 

1H 
IL 8 
jVus | ImputLow vottage('2) PCO ES 

| i a re ae 

co! 
IH 
IL 
OZ ‘Leal 






V 
V 
V 
V 
J 
| 


/iiH | Input HIGH Leakage ViH = VCC 
Ale, Input LOW Leakage Vit = GND : 
Output Tri-state' Leakage 


NOTES: 
1. Vit Min. = —3.0V for pulse width less than 15ns. Vit should not fall below —0.5 volts for larger periods. 
2. ViHs and Vis apply to Clk2xIn and Reset. 

3. Vit should not be held above Vcc + 0.5 volts. 

4. Guaranteed by design. 


Output Capacitance(4) — 
Operating Current Vcc = 5V, TA= 70°C | | 


VOH = 2.4V, VoL = 0.5V 
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AC ELECTRICAL CHARACTERISTICS "1 2: 9) (Tc = 0°C to +95°C, Voc = +5.0V +5%) 


Symbol | Signals | Description. | Min, Max. 


BusReq, Ack, BusError, RdCEn Set-up to SysCik rising 
t2 BusReq, Ack, BusError, RdCEn Hold from SysCik rising 


A/D, Addr, Diag, ALE, Wr Tri-state from SysCik rising 
Burst(WrNear, Rd, DataEn 

A/D, Addr, Diag, ALE, Wr Driven from SysClk falling 
BurstWrNear, Rd, DataEn 


Asserted from SysClk rising 
Negated from SysCik falling 
Valid from SysCik rising 
Valid from SysClk rising 
Asserted from SysClk rising 
Negated from SysCik falling 
Hold from ALE negated 
Asserted from SysCik 
Asserted from A/D tri-state(*) 
Driven from SysCik rising(4) 
t15 Wr, Rd, DataEn, Burst/WrNear, Last, TC Negated from SysClk falling 


t16 Addr(3:2), BE 16(1:0) Valid from SysCik 


> 
” 


> Id > iD =] 
Om 10 nm 7n 


t2a 
t3 


a |e 
jet) 
n 


-e 


ae 
ta 
eo 
Pip. =| 
Ca 
ie 
ca 
Heo 
[oi 
pier 
Nil = i 
a 
hia 
ite 


Valid from SysClk 
Tri-state from SysClk 


© 
SE ie) 


OH; fa jo ta 
Oo Oo 3 75 |S 


SysClk to data out an 
20 Pulse Width High & e| £ 
t21 


22 


Pulse Width Low see 
Clock Period Su 
Pulse Width from Vcc valid Soe Re 


or 


~-- -- 
—h 


ct 
N 
w 


Minimum Pulse Width 
Set-up to SysCik falling 


Fal 
i) 
DB 


oO) 


Le) 
on 


t 
27 Mode hold from Reset rising 
128 Sint, SBrCond Set-up to SysClk falling 


- 


29 | Sint, SBrCond 
30___| Int, BrCond 
t31 nt, BrCond 
tsys 


=| 
ee) 


wo 3/5 [5 


; ; 
Mode set-up to Reset rising 


QD 
o 
Q 
x 





Q 
o 
O 
x 


(33 sClk 
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16.67 MHz 
Symbol | Description | Min. — Max. | 
iOStrobe Valid from SysCIkK falling - fe 
ExtDataEn, DataEn Asserted from SysClk rising . 3 
Asserted from SysCik rising . ers 
25pf(4. 5) 


MemStrobe Negated from SysClk falling 
Asserted from Addr(3:0) valid’) 


All outputs Timing deration for loading 
| over 25pf(4. 5 


NOTES: . . 2905 tbl 12 
1. All timings referenced to 1.5 Volts, with a rise and fall time of less than 2.5ns. 

2. All outputs tested with 25 pF loading. . 

3. The AC values listed here reference timing diagrams contained in the R3041 Hardware User's Manual. 

4 i i 

5 


145 
t47 
t48 

| 054 
t52 


AC ELECTRICAL CHARACTERISTICS (CONTINUED) 


tderate 





. Guaranteed by design. . 
. This parameter is used to derate the AC timings according to the loading of the system. This parameter provides a deration for loads over the specified 


test condition; that is, the deration factor is applied for each 25 pF over the specified test load condition. 
6. Timings t34 - t44 are reserved for other R3051 family members. 
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PIN CONFIGURATIONS 














SS A28 & S&aR8 8 geeetee2 
S28 ee ee ee eee ee ee ee 
MILO ELL A 
Vss Vss 
Vcc Vcc 
Clkin A/D(14) 
TriState A/D(13) 
BE16(1) A/D(12) 
BE16(0) A/D(11) 
Addr(1) A/D(10) 
Addr(0) A/D(9) 
int(5) Vcc 
Vss IDT R3041 Vss 




















SBrCond(3)/ lOStrobe 
SBrCond(2)/ ExtDataEn 
TC 





Vss 


< 

oO 

oO 
Pe Ee ES eee sey ae eta 
ar ee ee ete tel En EES etek ke ok Ee 3 


Vcc 


@ 
BAS 
N 
© 
oO 

















JUUUUUUUUUUU UU UU UU UU 
eE la © 2 a 10 a) << {e 
g B 
= 5 
a 
2905 drw 07 
84-Pin PLCC/ 
Top View 
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Clkin 


SysClk 
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Figure 8. R3051 Family Clocking 


Vcc 


Clkin } 
t23 


Reset 


Figure 9. Power-On Reset Sequence cla ar 
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Figure 10 (a). Warm Reset Sequence 


ClkIn 


Reset 





2905 dew 4 
Figure 10 (b). Warm Reset Sequence (Internal Pull-Ups Used) 


Reset 
Mode Vector Inputs: 
Sint(2:0), mie) be 


Mode Vector Inputs: | Le acsctmmees, | 
Adar(:0), BETE(T =) | x CSC~*=S 


External Device Drives Signals 





CPU Drives 


; ; 2905 drw 12 
Figure 11. Mode Selection and Negation of Reset 
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Address Turn Sample 
Memory Bus Data? 


SysClk 


t14 


a 
Q. 
ie 








sie es ee 
~ a 
2905 drw 13 
Figure 12(a). Timing with Non-Extended Address Hold Option 
Address Extend Sample 
Memory Address Data? 
SysClk 
Rd 
A/D(31:0) cK Ao —_ 
jn = Tf 
_ ae 
Diag Cached? I/D 
2905 drw 14 


Figure 12(b). Timing with Extended Address Hold Option 
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Run/ 
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Rd 
A/D(31:0) 
Addr(3:2) 
ALE 
DataEn 
ExtDataEn 
Burst 

Last 
MemStrobe 
1OStrobe 
Ack 


Diag 


Stall | Fixup | 
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ame 
{hed 
ae 
JST 


t14 
: .. Addr 
w BE fa Data Inpu eA 
or | Spake t16 


Word Address 





a st 


el | ELL 
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BN 
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- 
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t12 


ptt add it | 
Per tatalas tT at 
p | | te 


an ee 
att ge 


17 
Cached? /D 
Start Extended -Ack/ Ack/ Ack/ Sample End 
Read Address RdCEn RdCEn RdCEn Data Read 
: 2 ? 2905 drw 15 


Figure 13. Single Datum Read 
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Run/ 
Stall 


PhiClk afl 
sysClk * 


: Pre a erg a ery a 
VY Addr YE arco} i< B ed Byte 2 fia Byte 3 


Stall 








: ' yte 1 % yte : yt 
A/D(31:0) “I, = Nae Es a ae — 
Addr(3:0) | nn f noo f mot’ pooh ant? — 

"SALLE 
eet | tt tL ae 
a ee 
TTT Ty t+ 
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u ae ee ee ee 
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Diag 4 Cached? VD 


Start Extended RdCEn Sample RdCEn Sample RdCEn Sample Ack/ Sample New 
Read Address Data Data Data RdCEn Data Transaction 
2905 drw 16 


Figure 14. Mini-burst read of 32-bit datum from 8-bit wide memory port 
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Run/ 
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ALE 
DataER 
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Figure 15. R3041 Quad Word Read 
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| Figure 16 (a). Quad Word Read to 16-bit wide Memory Port 
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| | ‘Refill Refill/ Refill 
. Refill/ Streany © Stream/ Streanv » 
— Fixup ror is 
Stall | Stall | | Fixup | ne | Eee 
| Word 0 Word 1 Word 2 Word 3 
_. TAC SA 
- i E 7 E 
Rd © = a * - | 
Hk tata Pr 
noger Ser Sa pi el, i Sel 
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Ack/ Sample RdCEn Sample RdCEn Sample RdCEn Sample New 
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Figure 16 (b). End of Quad Word read from 16-bit Wide Memory Port 
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Figure 17. Basic Write to 32-bit Memory Port 
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Figure 18. Tri-Byte Mini-burst Write to 8-bit Port 
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Figure 19. Request and Relinquish of R3041 Bus to External Master 
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Figure 20. R3041 Regaining Bus Mastership. 
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Figure 21. R3041 DMA Pulse Protocol 
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Figure 22. Synchronized Interrupt Input Timing 


Run Cycle Exception Vector 


Phi 
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t30 31 3 2905 drw 26 
Figure 23. Direct Interrupt Input Timing 
Run Cycle Capture BrCond BCzT/F Instruction 

Phi 

SysClk 
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Figure 24. Synchronized Branch Condition Input Timing 
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Figure 25. TC Output 


84 LEAD PLCC (SQUARE) 


45° x .045 


D3/E3 
D2/E2 





SEATING PLANE 
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NOTES: 

1. All dimensions are in inches, unless otherwise noted. 

BSC—Basic lead Spacing between Centers. 

D & E do not include mold flash or protutions. 

. Formed.leads shall be planar with respect to one another and within .004” at the seating plane. 
ND & NE represent the number of leads in the D & E directions respectively. 

D1 & E1 should be measured from the bottom of the package. 

. PLCC is pin & form compatible with MQUAD; the MQUAD package is used in other R3051 
family members. 
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ORDERING INFORMATION 
XXXXX - XX X X 
[DT —_— 
Device Type Speed Package  Process/ 
Temp. Range 
Blank Commercial Temperature Range 
‘J’ 84-Pin PLCC 
‘16 16.67 MHz 
‘20' 20.0 MHz 
79R3041 Integrated RISController for 
Low-cost Systems . 
2905 drw 29 
VALID COMBINATIONS 
IDT 79R3041 - 16, 20J PLCC Package, Commercial Temperature Range 
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Integrated Device Technology, Inc. 


-| IDT79R3051/79 R3052 mc 


INTEGRATED IDT79R3052™, 79R3052E . 
RIiSControllers™ | ‘. a 






> IDT79R3051™, 79R3051E- 











FEATURES: = = | | 
« Instruction set compatible with IDT79R3000A and 
IDT79R3001 MIPS RISC CPUs 


- High level of integration minimizes system cost, power 


consumption a 

— IDT79R3000A /IDT79R3001 RISC Integer CPU 

— R3051 features 4kB of Instruction Cache 

— R3052 features 8kB of Instruction Cache 

— All devices feature 2kB of Data Cache 

— “E” Versions (Extended Architecture) feature full 
function Memory Management Unit, including 64- 
entry Translation Lookaside Buffer (TLB) 

— 4-deep write buffer eliminates memory write stalls 

— 4-deep read buffer supports burst refill from slow 
memory devices 


— On-chip DMA arbiter | 
'-— Bus Interface minimizes design complexity 
Single clock input with 40%-60% duty cycle 
- Direct interface to R3720/21/22 RISChipset™ 
35 MIPS, over 64,000 Dhrystones at 40MHz 
Low cost 84-pin PLCC packaging that's pin/package com- 
patible with thermally-enhanced 84-pin MQUAD. 
Flexible bus interface allows simple, low cost designs 
20, 25, 33, and 40MHz operation 
Complete software support 
— Optimizing compilers 
— Real-time operating systems 
— Monitors/debuggers 
— Floating Point Software. . -.. 
— Page Description Languages 
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Figure 1. R3051 Family Block Diagram 
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INTRODUCTION 


The IDT IDT79R3051 family is a series of high-perfor- 
mance 32-bit microprocessors featuring a high level of inte- 
gration, and targeted to high-performance but cost sensitive 
embedded processing applications. The IDT79R3051 family 
is designed to bring the high-performance inherent in the 
MIPS RISC architecture into low-cost, simplified, power 
sensitive applications. 

Functional units were integrated onto the CPU core in order 
to reduce the total system cost, without significantly degrading 
system performance. Thus, the IDT79R3051 family is able to 
offer 35 MIPS of integer performance at 40MHz without 
requiring external SRAM or caches. 

Further, the [DT79R3051 family brings dramatic power 
reduction to these embedded applications, allowing the use of 
low-cost packaging for devices up to 25 MHz. The IDT79R3051 
family allows customer applications to bring maximum per- 
formance at minimum cost. 

Figure 1 shows a block level representation of the functional 
units within the IDT79R3051 family. The IDT79R3051 family 
could be viewed as the embodiment of a discrete solution built 
around the IDT79R3000A or IDT79R3001. However, by 
integrating this functionality on a single chip, dramatic cost 
and power reductions are achieved. 

Currently, there are four members of the IDT79R3051 

family family. All devices are pin and software compatible: the 
differences lie in the amount of instruction cache, and in the 
memory management capabilities of the processor: 
The IDT79R3052"E” incorporates 8kB of Instruction Cache, 
and features a full function memory management unit 
(MMU) including a 64-entry fully-associative Translation 
Lookaside Buffer (TLB). This is the same memory man- 
agement unit incorporated in the IDT79R3000A and 
IDT79R3001. 


The IDT79R3052 also incorporates 8kB of Instruction Cache. 
However, the memory management unit is a much simpler 
subset of the capabilities of the enhanced versions of the 
architecture, and in fact does not use a TLB. 


The IDT79R3051"E” incorporates 4kB of Instruction Cache. 
Additionally, this device features the same full function 
MMU (including TLB file) as the IDT79R3052"E”, and 
IDT79R3000A. 

The IDT79R3051 incorporates 4kB of Instruction Cache, 
and uses the simpler memory management model of the 
IDT79R3052. 

An overview of the functional blocks incorporated in these 
devices follows. 


CPU Core 


The CPU core is a full 32-bit RISC integer execution 
engine, capable of sustaining close to single cycle execution 
rate. The CPU core contains a five stage pipeline, and 32 
orthogonal 32-bit registers. The IDT79R3051 family imple- 
ments the MIPS ISA. In fact, the execution engine of the 
IDT79R3051 family is the same as the execution engine of the 
IDT79R3000A (and IDT79R3001). Thus, the IDT79R3051 
family is binary compatible with those CPU engines. 


The execution engine of the IDT79R3051 family uses a 
five-stage pipeline to achieve close to single cycle execution. 
A new instruction can be started in every clock cycle; the 
execution engine actually processes five instructions con- 
currently (in various pipeline stages). Figure 2 shows the 
concurrency achieved by the IDT79R3051 family pipeline. 
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Figure 2. R3051 Family 5-Stage Pipeline 


System Control Co-Processor 


The R3051 family also integrates on-chip the System 
Control Co-processor, CPO. CPO manages both the excep- 
tion handling capability of the IDT79R3051 family, as well as 
the virtual to physical mapping of the IDT79R3051 family. 

There are two versions of the IDT79R3051 family architec- 
ture: the Extended Architecture Versions (the IDT79R3051E 
and IDT79R3052E) contain a fully associative 64-entry TLB 
which maps 4kB virtual pages into the physical address 
space. The virtual to physical mapping thus includes kernel 
segments which are hard mapped to physical addresses, and 
kernel and user segments which are mapped on a page basis 
by the TLB into anywhere within the 4gB physical address 
space. In this TLB, 8 page translations can be “locked” by the 
kernel to insure deterministic response in real-time applications. 
These versions thus use the same MMU structure as that 
found in the IDT79R3000A and IDT79R3001. Figure 3 shows 
the virtual to physical address mapping found in the Extended 
Architecture versions of the processor family. 

The Extended Architecture devices allow the system 
designer to implement kernel software to dynamically manage 
User task utilization of memory resources, and also allow the 
Kernel to effectively “protect” certain resources from user 
tasks. These capabilities are important in a number of 
embedded applications, from process control (where resource 
protection may be extremely important) to X-Window display 
systems (where virtual memory management is extremely 
important), and can also be used to simplify system debugging. 
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Figure 3. Virtual to Physical Mapping of Extended Architecture Versions 


The base versions of the architecture (the IDT79R3051 
and IDT79R3052) remove the TLB and institute a fixed 
address mapping for the various segments of the virtual 
address space. The base processors support distinct kernel 
and user mode operation without requiring page management 
software, leading to a simpler software model. The memory 
mapping used by these devices is illustrated in Figure 4. Note 
that the reserved address spaces shown are for compatibility 
with future family members; in the current family members, 
references to these addresses are translated in the same 
fashion as their respective segments, with no traps or ex- 
ceptions taken. 
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When using the base versions of the architecture, the 
system designer can implement a distinction between the 
user tasks and the kernel tasks, without having to execute 
page management software. This distinction can take the 
form of physical memory protection, accomplished by address 
decoding, or in other forms. In systems which do not wish to 
implement memory protection, and wish to have the kernel 
and user tasks operate out of a single unified memory space, 
upper address lines can be ignored by the address decoder, 
and thus all references will be seen in the lower gigabyte of the 
physical address space. 


PHYSICAL 


Kernel Cacheable 


Tasks 1024 MB 
Kernel/User 
Cacheable 2048 MB 
Tasks 
| Inaccessible 512 MB 
Kernel Boot 
and |/O 512 MB 
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Figure 4. Virtual to Physical Mapping of Base Architecture Versions 
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Clock Generation Unit 


The IDT79R3051 family is driven from a single input clock, 
capable of operating in a range of 40%-60% duty cycle. On- 
chip, the clock generator unit is responsible for managing the 
interaction of the CPU core, caches, and bus interface. The 
clock generator unit replaces the external delay line required 
in IDT79R3000A and IDT79R3001 based applications. 


Instruction Cache 


The current family includes two different instruction cache 
sizes: the IDT79R3051 family (the IDT79R3051 and 
IDT79R3051E) feature 4kB of instruction cache, and the 
IDT79R3052 and IDT79R3052E each incorporate 8kB of 
Instruction Cache. For all four devices, the instruction cache 
is organized as a line size of 16 bytes (four words). This 
relatively large cache achieves a hit rate well in excess of 95% 
in most applications, and substantially contributes to the 
performance inherent in the IDT79R3051 family. The cache 
is implemented as a direct mapped cache, and is capable of 
caching instructions from anywhere within the 4gB physical 
address space. The cache is implemented using physical 
addresses (rather than virtual addresses), and thus does not 
require flushing on context switch. 


Data Cache 


All four devices incorporate an on-chip data cache of 2kB, 
organized as a line size of 4 bytes (one word). This relatively 
large data cache achieves hit rates well in excess of 90% in 
most applications, and contributes substantially to the perfor- 
mance inherent in the IDT79R3051 family. As with the 
instruction cache, the data cache is implemented as a direct 
mapped physical address cache. The cache is capable of 
mapping any word within the 4gB physical address space. 


Bus Interface Unit 


The IDT79R3051 family uses its large internal caches to 
provide the majority of the bandwidth requirements of the 
execution engine, and thus can utilize a simple bus interface 
connected to slow memory devices. 

The IDT79R3051 family bus interface utilizes a 32-bit 
address and data bus multiplexed onto a single set of pins. 
The bus interface unit also provides an ALE signal to de- 
multiplex the A/D bus, and simple handshake signals to 
process processor read and write requests. In addition to the 
read and write interface, the IDT79R3051 family incorporates 
a DMA arbiter, to allow an external master to control the 
external bus. 

The IDT79R3051 family incorporates a 4-deep write buffer 
to decouple the speed of the execution engine from the speed 
of the memory system. The write buffers capture and FIFO 
processor address and data information in store operations, 
and presents it to the bus interface as write transactions at the 


rate the memory system can accommodate. 


The IDT79R3051/52 read interface performs both single 
word reads and quad word reads. Single word reads work with 
a simple handshake, and quad word reads can either utilize 
the simple handshake (in lower performance, simple systems) 


or utilize a tighter timing mode when the memory system can 


The data cache is implemented as a write through cache, | 


to insure that main memory is always consistent with the 
internal cache. In order to minimize processor stalls due to 
data write operations, the bus interface unit incorporates a 4- 
deep write buffer which captures address and data at the 


processor execution rate, allowing it to be retired to main — 


memory at a much slower rate without impacting system 
performance. 


5.5 


burst data at the processor clock rate. Thus, the system 
designer can choose to utilize page or nibble mode DRAMs 
(and possibly use interleaving), if desired, inhigh-performance 
systems, or use simpler techniques to reduce complexity. 

, In order to accommodate slower quad word reads, the 
IDT79R3051 family incorporates a 4-deep read buffer FIFO, 
so that the external interface can queue up data within the 
processor before releasing it to perform a burst fill of the 
internal caches. Depending on the cost vs. performance 
tradeoffs appropriate to a given application, the system design 
engineer could include true burst Support from the DRAM to 
provide for high-performance cache miss processing, or uti- 
lize the read buffer to process quad word reads from slower 
memory systems. . 
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SYSTEM USAGE | 


The IDT79R3051 family has been specifically designed to 
easily connect to low-cost memory systems. Typical low-cost 
memory systems utilize slow EPROMs, DRAMs, and appli- 
cation specific peripherals. These.systems may also typically 
contain large, slow static RAMs, although the IDT79R3051 
family has been designed to not specifically require the use of 
external SRAMs..  - | - 4 

Figure 5 shows a typical system block diagram. Transpar- 
ent latches are used to de-multiplex the IDT79R3051/52 
address and data busses from the A/D bus. The data paths 


ern . 
Clk2xIn 
Int(5:0) 
| BrCond(3:0) 
BusReq 
BusGnt 


AD(31:0) 
















FCT373T 


FCT245T 





IDT R3051 Family 
RiSController 


between the memory system elements and the R3051 family 
A/D bus is managed by simple octal devices. A small set of 
simple PALs can be used to control the various data path 
elements, and to control the handshake between the memory 
devices and the CPU. | 7 = 

_ Alternately, the memory interface canbe constructed using 
the IDT79R3051 family RISChipset, which includes DRAM 
control, data path control for interleaved memories, and other 
general memory and system interface control functions. These 
devices are described in separate data sheets. Figure 6 
illustrates a simple system constructed using the R3051 
RISChipset. : | | 









~Burst/ 
RdCEn WrNear 





| Wr 
ALE Addr(3:2) SysCik Rd Ack DataEn BErr 








Memory and Interface 
Control PALs 









Address 
Decode 
PAL 








I/O Devices/ 


Peripherals 


System I/O 
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Figure 5. Typical R3051 Family Based System 
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Figure 6. R3051 Family Chip Set Based System 
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DEVELOPMENT SUPPORT 
The IDT79R3051 family is supported by a rich set of 
development tools, ranging from system simulation tools 


through prom monitor support, logic analysis tools, and ee 


system modules. 


Figure 7 is an overview of the system development process | 


typically used when developing IDT79R3051 family-based 
applications. The IDT79R3051 family is supported by pow- 
erful tools through all phases of project development. These 
tools allow timely, parallel development of hardware and 
software for IDT79R3051/52 based applications, and include 
tools such as: 

¢ A program, Cache-3051, which allows the performance of 


an IDT79R3051 family based system to be modeled and 


understood without requiring actual hardware. 
Sable, an instruction set simulator. 


in optimizing compiler technology. 


IDT Cross development tools, available | in a variety of 
development environments. 


System 


Optimizing compilers from MIPS, the sdkiowicdeed ieader 


The high-performance IDT floating point library software, 


which has been integrated into the compiler toolchain to 
allow software floating point to replace hardware floating 


. point without modifying the original source code. 


e 


@ 


° 


The IDT Evaluation Board, which includes RAM, EPROM, 
/O, and the IDT Prom Monitor. 


The IDT Laser Printer System board, which directly drives 
a low-cost print engine, and runs Microsoft Truelmage™ 
Page Description Language on top of PeerlessPage™ 
Advanced Printer Controller BIOS. 


-Adobe PostScript™ Page Description Language, ported to 


the R3000 instruction set, runs on the IDT79R3051 family. 
The IDT Prom Monitor, which implements a full prom 


_. monitor. (diagnostics, remote debug support, peek/poke, 
- @tc.). 


An In-Circuit Emulator, developed and sold by Embedded 
Performance, Inc. | 


; System | System 
Architecture Development Integration 
Evaluation Phase and Verification 





SABLE Simulator 
'  DBG Debugger’ 
PIXIE Profiler 

MIPS Compller Suite 

Stand-Alone Libraries _ 

Floating Point Library 
Cross Development Tools 

Adobe PostScript™ PDL 


Cache-R305x 
Benchmarks 
Evaluation Board 
Laser Printer System 


:| MicroSoft Truelmage™ PDL. - 
s Ada 


Logic Analysis 
Diagnostics 
IDT PROM Monitor 
Remote Debug 
Real-Time OS 
in-Circult Emulator 


Cache-R305x 
Hardware Models 
’ General CAD Tools 
RISC Sub-systems 
Evaluation Board _ 
Laser Printer System — 
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Figure 7. R3051 Family Development Toolchain 
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PERFORMANCE OVERVIEW 


The IDT79R3051 family achieves a very high-level of 
performance. This performance is based on: 

- An efficient execution engine. The CPU performs ALU 
operations and store operations at a single cycle rate, and 
has an effective load time of 1.3 cycles, and branch execu- 
tion rate of 1.5 cycles (based on the ability of the compilers 
to avoid software interlocks). Thus, the execution engine 
achieves over 35 MIPS performance when operating out of 
cache. 


¢ Large on-chip caches. The IDT79R3051 family contains 
caches which are substantially larger than those on the 
majority of today’s embedded microprocessors. These 
large caches minimize the number of bus transactions 
required, and allow the R3051 family to achieve actual 
sustained performance very close to Its peak execution rate. 


*Autonomous multiply and divide operations. The 
IDT79R3051 family features an on-chip integer multiplier/ 
divide unit which is separate from the other ALU. This allows 
the IDT79R3051 family to perform multiply or divide op- 
erations in parallel with other integer operations, using a 
single multiply or divide instruction rather than “step” op- 
erations. 


- Integrated write buffer. The IDT79R3051 family features 
a four deep write buffer, which captures store target ad- 
dresses and data at the processor execution rate and retires 
itto main memory at the slower main memory access rate. 
Use of on-chip write buffers eliminates the need for the 
processor to stall when performing store operations. 


¢ Burst read support. The |IDT79R3051 family enables the 
system designer to utilize page mode or nibble mode RAMs 
when performing read operations to minimize the main 
memory read penalty and increase the effective cache hit 
rates. 


These techniques combine to allow the processor to achieve 
35 MIPS integer performance, and over 64,000 dhrystones at 
40MHz without the use of external caches or zero wait-state 
memory devices. 


SELECTABLE FEATURES 
The IDT79R3051 family allows the system designer to 

configure some aspects of operation. These aspects are 

established when the device is reset, and include: 

¢ Big Endian vs. Little Endian operation: The part can be 
configured to operate with either byte ordering convention, 
and in fact may also be dynamically switched between the 
two conventions. This facilitates the porting of applications 
from other processor architectures, and also permits inter- 
communications between various types of processors and 
databases. 

«Data cache refill of one or four words: The memory 
system must be capable of performing 4 word transfers to 
satisfy cache misses. This option allows the system designer 
to choose between one and four word refill on data cache 
misses, depending on the performance each option brings 
to his application. 


THERMAL CONSIDERATIONS 


The IDT79R3051 family utilizes special packaging tech- 
niques to improve the thermal properties of high-speed pro- 
cessors. Thus, all versions of the IDT79R3051 family are 
packaged in cavity down packaging. 

The lowest cost members of the family use a standard 
cavity down, injection molded PLCC package (the "J" pack- 
age). This package, coupled with the power reduction tech- 
niques employed in the design of the IDT79R3051 family, 
allows operation at speeds to 25MHz. However, at higher 
speeds, additional thermal care must be taken. 

Thus, the IDT79R3051 family is also available in the 
MQUAD package (the "MJ" package), which is an all alumi- 
num package with the die attached to a normal copper lead- 
frame mounted to the aluminum casing. The MQUAD allows 
for a more efficient thermal transfer between the die and the 
case of the part due to the heat spreading effect of the 
aluminum. The aluminum offers less internal resistance from 
one end of the package to the other, reducing the temperature 
gradient across the package and therefore presenting a 
greater area for convection and conduction to the PCB for a 
given temperature. Even nominal amounts of airflow will 
dramatically reduce the junction temperature of the die, result- 
ing in cooler operation. The MQUAD package is available at 
all frequencies, and is pin and form compatible with the PLCC 
package. Thus, designers can choose to utilize this package 
without changing their PCB. 

Finally, the IDT79R3051 family is also available in a cavity 
down PGA with integral thermal slug. As with the MQUAD | 
package, this package is highly thermally efficient, and is 
appropriate for use in more extreme temperature conditions, 
such as military applications. 

The members of the IDT79R3051 family are guaranteed in 
a case temperature range of 0°C to +95°C. The type of 
package, speed (power) of the device, and airflow conditions, 
affect the equivalent ambient conditions which meet this 
specification. 

The equivalent allowable ambient temperature, TA, can be 
calculated using the thermal resistance from case to ambient 
(OcA) of the given package. The following equation relates 
ambient and case temperature: 

TA=TCc-P*@CA 
where P is the maximum power consumption at hot tempera- 
ture, calculated by using the maximum Icc specification for the 
device. 

Typical values for OcA at various airflows are shown in 
Table 1 for the various packages. 








acne a0] 800 95 
| "MJ" Package” | 22 | 14 | 12 | 
Tran ecige | oo [1 [o — 


Table 1. Thermal Resistance (Oca) at Various Airflows 
(*estimated: final values tbd) 
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PIN DESCRIPTION 


PIN NAME - 4 oe : DESCRIPTION 


‘Address/Data: A 32-bit time multiplexed bus which indicates the desired address fora bus transaction 
in one phase, and whichis used to transmit data between the CPU and external EemOTy resources during 
the rest of the transfer. 


Bus transactions on this bus are logically separated into two phases: during the first phase, information 
about the transfer is presented to the memory system to be captured using the ALE output. This 
information consists of: 


-Address(31:4): The high-order address for the transfer is presented on A/D(31:4). 


BE(3:0): — These strobes indicate which bytes of the 32-bit bus will be involved in 
| the transfer, and are presented on A/D(3:0). 


During write cycles, the bus contains the data to be stored and is driven from the internal write buffer. 
On read cycles, the bus receives the data from the external resource, in either a single data | 
transaction or in a burst of four words, and places it into the on-chip read buffer. 


Low Address (3:2) A 2-bit bus which indicates which word is currently expected by the processor. 
Specifically, this two bit bus presents either the address bits for the single word to be transferred (writes 
or single datum reads) or functions as a two bit counter starting at ‘00’ for burst read operations. 


Addr(3:2) 


Diagnostic Pin 1. This output indicates whether the current bus read transaction is due to an on- 
chip cache miss, and also presents part of the miss address. The value output on unis pin is time 
multiplexed: 


Cached: During the phase in whichthe A/D bus presents sadiess information, this 
pin is an active high output which indicates whether the current read is 
~ a result of a cache miss. The value of this pin at this time in other than 

read cycles is undefined. : 


iag(1 
| Miss Address (3): =~ During the remainder of the read operation, this output presents 
a | address bit (3) of the address the processor was attempting to 
| . ' reference when the cache miss occurred. Regardless of whether a 
| ~ cache miss is being processed, this pin reports the transfer address 
. | ee during this time. . ; ee 
Diag(0) Diagnostic Pin 0. This output distinguishes cache misses due to instruction references from those — 
due to data references, and presents the remaining bit of the miss address. The value output on this 
: | pin is also time multiplexed: 


VD: . If the “Cached” Pin indicates a cache miss, then a high on this pin at this 
| time indicates an instruction reference, and a low indicates a data 
reference. If the readis not due to a cache miss but rather an uncached 
reference, then this pin is undefined during this phase. 


Miss Address (2): _ During the remainder of the read operation, this output presents 
address bit (2) of the address the processor was attempting to 
reference when the cache miss occurred. Regardless of whether a. 
cache miss is being processed, this pin reports the transfer address 
during this time. . 





Address Latch Enable: Used to indicate that the AD bus contains valid address information for 
the bus transaction. This signali is used by external logic to capture the address for the transfer, typically 
using transparent latches. 


ALE 





External Data Enable: This signal indicates that the A/D bus is no longer being driven by the processor 
during read cycles, and thus the external memory system may enable the drivers of the memory 


system onto this bus without having a bus conflict occur. During write cycles, or when no bus | 
transaction is occurring, this signal is negated, thus disabling the external memory drivers. 
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PIN DESCRIPTION (Continued): © 


Burst Transfer/Write Near: On read transactions, the Burst signal indicates that the current bus read 
is requesting a block of four contiguous words from memory. This signal is asserted only in read cycles 
due to cache misses; it is asserted for all |-Cache miss read cycles, and for D-Cache miss read cycles 
if selected at device reset time. 


On write transactions, the WrNear output tells the external memory system that the bus interface unit 
is performing back-to-back write transactions to an address within the same 256 word page as the prior 
write transaction. This signal is useful in memory systems which employ page mode or static column 
DRAMs, and allows near writes to be retired quickly. 


[AG fc Read: An output which indicates that the current bus transaction is a read. 
ae rae Write: An output which indicates that the current bus transaction is a write. 


Ack Acknowledge: An input which indicates to the device that the memory system has sufficiently 
_ processed the bus transaction, and that the CPU may either terminate the write cycle or 
process the read data from this read transfer. 


Read Buffer Clock Enable: An input which indicates to the device that the memory system has 
placed valid data on the A/D bus, and that the processor may move the data into the on-chip Read 


System Reference Clock: An output from the CPU which reflects the timing of the internal 
processor "Sys" clock. This clock is used to control state transitions in the read buffer, write buffer, 
memory controller, and bus interface unit. 


DMA Arbiter Bus Request: An input to the device which requests that the CPU tri-state its bus 
interface signals so that they may be driven by an external master. 


DMA Arbiter Bus Grant. An output from the CPU used to acknowledge that a BusReg has been 
detected, and that the bus is relinquished to the external master. | 


SBrCond(3:2) Branch Condition Port: These external signals are internally connected to the CPU signals: 

BrCond(1:0) CpCond(3:0). These signals can be used by the branch on co-processor condition instructions as input 
ports. There are two types of Branch Condition inputs: the SBrCond inputs have special internal 
logic to synchronize the inputs, and thus may be driven by asynchronous agents. The direct Branch 
Condition inputs must be driven synchronously. 





Bus Error: Input to the bus interface unit to terminate a bus transaction due to an external bus error. 
This signal is only sampled during read and write operations. If the bus transaction is a read operation, 
then the CPU will take a bus error exception. 


Processor Interrupt: During normal operation, these signals are logically the same as the Int(5:0) 
signals of the R3000. During processor reset, these signals perform mode initialization of the CPU, but 
in a different (simpler) fashion than the interrupt signals of the R3000. 


There are two types of interrupt inputs: the Sint inputs are internally synchronized by the processor, 
and may be driven by an asynchronous external agent. The direct interrupt inputs are not internally 
synchronized, and thus must be externally synchronized to the CPU. The direct interrupt inputs have 
one cycle lower latency than the synchronized interrupts. 


Clk2xIn Master Clock Input: This is a double frequency input used to control the timing of the CPU. 


Master Processor Reset: This signal initializes the CPU. Mode selection is performed during 
the last cycle of Reset. 


Reserved: These five signal pins are reserved for testing and for future revisions of this device. 
Users must not connect these pins. 
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ABSOLUTE MAXIMUM RATINGS" 9) 
[Symbol] _ Rating __| Commercial 


VterM | Terminal Voltage |—0.5 to +7.0 
with Respect 
to GND 


Tc Operating Case 0 to +95 —55 to +125] °C 
Temperature 
Tsias | Temperature —55 to +125 
| Under Bias | : 


TSTG Storage —55 to +125 | -65to +155| °C 
Temperature 












—0.5to+7.0] 

















NOTES: : 2874 tbl 04 

1. Stresses greater than those listed under ABSOLUTE MAXIMUM RATINGS 
may cause permanent damage to the device. This is a stress rating only 
and functional operation of the device at these or any other conditions 
above those indicated in the operational sections of this specification is not 

_ implied. Exposure to absolute maximum rating conditions for extended 

periods may affect reliability. 

2. ViN minimum = —3.0V for pulse width less than 15ns. 
Vin should not exceed Vcc +0.5V. 

3. Notmore than one outputshould be shorted ata time. Duration of the short 
should not exceed 30 seconds. 


AC TEST CONDITIONS 











vis [put Low Votage | — [04 |v 
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Input Voltage -0.5to +7.0 | -0.5 to +7.0 7 


RECOMMENDED OPERATING 


"tannery unr] TEMPERATURE AND SUPPLY VOLTAGE > 
| Grade | Temperature _ 


Military | -55°C to +125°C 
(Case) 
Commercial 0°C to +95°C 
| (Case) 










| GND | Vcc __ 
5.0+10% 
5.0 +5% 
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- OUTPUT LOADING FOR AC TESTING 


5.5 





To Device 
Under Test 
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DC ELECTRICAL CHARACTERISTICS (Tc = 0°C to +95°C, Vcc = +5.0V +5%) 


Output HIGH Voltage Vcc = Min., loH} = 4mA 65 
Output LOW Voltage Vcc = Min., lou = 4mA 
Input HIGH Voltage() 

Cc 








Unit 











0 


—_ 
oO 


CIN | Input Capacitance 
Output Capacitance(4 


OH 
OL 
IH 
IL 
(2,3) 
IN (4) 
l ) 
ec = 
NH 
NL 
loz 


3 
2. 
3 


— 
(o>) 


Es 

iin. Max | Min. Mex 

<_ 

<a 

—_ 

[= oe [— 08 

. = 

s[—_oa[— oa] — ow 

0 z 

f= 10 [- | — 

o[ = aso | — 600 — 700 

. 
: 


|i | Input HIGH Leakage Vin = VCC — 100 100 
}ii | Input LOW Leakage Vit = GND -100  — —100 100 — 
|loz | Output Tri-state Leakage | VoH=2.4V,VoL=0.5V  |-100 100]-100 100 {-100 100}-100 100 | p 


NOTES: 2874 tbl 07 
1. ViL Min. = -3.0V for pulse width less than 15ns. Vit should not fall below —0.5V for larger periods. 

2. VIHs and Vits apply to Clk2xIn and Reset. 

3. Vit should not be held above Vcc + 0.5V. 

4. Guaranteed by design. 
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AC ELECTRICAL CHARACTERISTICS "1! 2:3) (Tc = 0°C to 495°C, Voc=4+5.0V45%) 


Signals Description [Min Max. | Min. Max. | Min. Max. | Min. Max. | 
5 tee Be ss * ies 3 oa, 
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A/D Driven from SysCik rising(*) 
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121 Pulse Width Low 
Clock Period 
Pulse Width from Vcc valid 





ei 
| jo 
—, 
mn 
| 


t5 
t7 
10 




















AT 

aS 

uo 

we) 

ns =] 312 |Z te 3/5 = | 5 S155 

ee 7) wo 1% TF oO 1H a n w H 1H 1H M - 








Oo 








oO 


~_ - - 
ah —_— —_ 


Q 
ra 
& 
vale 
5 


250 | 12.5 250 


200 


(oa) 
Nm 
(94) 
Oo 
N 
© 


—_ ond, 
ss _k 


1, pA on en | | || 
| || I qt ql | ° S| 


250 


-- 


22 
23 





nfo 
oO 1 
| | || 


200 

















t24 [Rest = (ti(i‘s*z Minimum Pulse Width 2 32 — 32 32 — 

125 | Rest = Set-up to SysCiK falling [5 | 
jo [im = Moe set-up to Reset rising as 
je7 int Mode hot trom Reset rising | 
Set-up to SysCIk falling 3. 
[20 | Sint, srcond 20 = 

t30 nt, BrCond Set-up to SysClk falling haa 3 — 


| 


t31 nt, BrCond Hold from SysClik falling 











Jin Brcond | Hold from SysCik falling | 
ys SysClk Pulse Width 2*t22  -2*t22 | 2*t22) = b*t22 | 2*t22 = 2*t22 | 2*t22 =. 2*t22 

t32 SysClk Clock High Time t22-—2 t224+2|t22-—2 t22+2}t22-—1 t22+1);t22-1 t22+1 

t33 SysClk Clock Low Time t22-—2 t224+2|t22—2 t22+2|t22-1 t22+1}t22-1 t22+1 

tderate All outputs Timing deration for loading — 0.5 _— 0.5 

over 25pf(4: 5) | 

NOTES: 2874 tbl 08 
1. All timings referenced to 1.5V, with a rise and fall time of less than 2.5ns. 
2. All outputs tested with 25pF loading. 
3. The AC values listed here reference timing diagrams contained in the R3051 Family Hardware User's Manual. 
4. Guaranteed by design. 
5. This parameter is used to derate the AC timings according to the loading of the system. This parameter provides a deration for loads over the specified 


test condition; that is, the deration factor is applied for each 25pF over the specified test load condition. 
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PIN CONFIGURATIONS 
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84-Pin PLCC/MQUAD 
Top View 


NOTE: 
Reserved Pins must not be connected. 
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PIN CONFIGURATIONS (CONTINUED) 


R3051 
84-Pin Ceramic Pin Grid Array 


(Cavity Down) 


Bottom View 
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84-Pin PGA with Integral Thermal Slug 
BottomView 


NOTE: 
Reserved Pins must not be connected. 
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Figure 8 R3051 Family Clocking 
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Figure 9. Power-On Reset Sequence 
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2874 drw 13 
Figure 10. Warm Reset Sequence 
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Figure 11. Mode Selection and Negation of Reset 
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Figure 12. Single Datum Read in R3051 Family 
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Figure 13. R3051 Family Burst Read 
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Figure 14 (a). Start of Throttled Quad Read 
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Figure 14 (b). End of Throttled Quad Read 
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Figure 16. Request and Relinquish of R3051 Family Bus to External Master 
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Figure 17. R3051 Family Regaining Bus Mastership 
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Figure 18. Synchronized Interrupt Input Timing 
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_ Figure 19. Direct Interrupt input Timing 
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Figure 21. Direct Branch Condition Input Timing 
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84-PIN PGA (CAVITY DOWN) 


oBi1 
4 5 6;7 8 9 10/11 12 


> WovuovMmmntg®da - re xn -r Zz 





SEATING PLANE 





NOTES: 
. Ail dimensions are in inches, unless otherwise noted. | . 2874 drw 26 
BSC—Basic lead Spacing between Centers 

Symbol “M” represents the PGA matrix size. 

Symbol “N” represents the number of pins. 

. Chamfered corners are IDT’s option. 

. Shaded area indicates integral metallic heat sink. 


OAR wr 


D1/E1 1.100 BSC 
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84 LEAD PLCC/MQUAD” (SQUARE) 


45° x .04 





. ord be ~ SEATING PLANE 


. 2874 drw 27 


NOTES: 

. All dimensions are in inches, unless otherwise noted. 

BSC—Basic lead Spacing between Centers. 

D & E do not include mold flash or protutions. 

. Formed leads shall be planar with respect to one another and within .004” at the seating plane. 
ND & NE represent the number of leads in the D & E directions respectively. 

. D1 & E1 should be measured from the bottom of the package. | 

. MQUAD is pin & form compatible with PLCC. =: wt, 


NOOR WD = 
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ORDERING INFORMATION 





si XXXXX - XX X X 
Device Type Speed Package Process/ 
Temp. Range 
Blank 
'MJ' 
'G' 
'90' 
15 
'33' 
'40' 
79R3051 
79R3051E 
79R3052 
79R3052E 
VALID COMBINATIONS 
IDT 79R3051 - 20, 25 All packages 

79R3051E - 20, 25 All packages 

79R3052 - 20, 25 All packages 

79R3052E - 20, 25 All packages 


79R3051 - 33, 40 
79R3051E - 33, 40 
79R3052 - 33, 40 
79R3052E - 33, 40 


PGA, MJ Packages Only 
PGA, MJ Packages Only 
PGA, MJ Packages Only 
PGA, MJ Packages Only 


5.5 


Commercial Temperature Range 


84-Pin PLCC 
84-Pin MQUAD 
84-Pin PGA with Integral Thermal Slug 


20.0 MHz 
25.0 MHz 
33.33 MHz 
40.0 MHz 


4kB Instruction Cache, No TLB 
4kB Instruction Cache, With TLB 
8kB Instruction Cache, NoTLB | 
8kB Instruction Cache, With TLB 
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IDT 79R3081™, 79R3081E 
IDT 79R3081L, 79R3081LE 


IDT79R3081 
RiSController™ 


Integrated Device Technology, Inc. 











FEATURES 

¢ Instruction set compatible with IDT79R3000A, R3051, 
and R3500 RISC CPUs 

High level of integration minimizes system cost 

— R3000A Compatible CPU 

— R3010A Compatible Floating Point Accelerator 

— Optional R3000A compatible MMU 

— Large Instruction Cache 

— Large Data Cache 

— Read/Write Buffers 

35 VUPS at 40MHz 

— 64,000 Dhrystones 

— 11 MFlops 

Flexible bus interface allows simple, low cost designs 
Optional 1x or 2x clock input | 
20 through 40MHz operation 

"L" version operates at 3.3V 


e 


e 


e 


e 


Large on-chip caches with user configurability 

— 16kB Instruction Cache, 4kB Data Cache 

— Dynamically configurable to 8kB Instruction 
Cache, 8kB Data Cache 

— Parity protection over data and tag fields 

Low cost 84-pin packaging 

Superset pin- and software-compatible with R3051 

Multiplexed bus interface with support for low-cost, low- 

speed memory systems with a high-speed CPU 

On-chip 4-deep write buffer eliminates memory write 

stalls 

On-chip 4-deep read buffer supports burst or simple 

block reads 

On-chip DMA arbiter 

Hardware-based Cache Coherency Support 

Programmable power reduction mode 

Bus Interface can operate at half-processor frequency 
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INTRODUCTION 

The IDT R3051 family is a series of high-performance 32- 
bit microprocessors featuring a high-level of integration, and 
targeted to high-performance but cost sensitive processing 
applications. The R3051 family is designed to bring the high- 
performance inherent in the MIPS RISC architecture into 
low-cost, simplified, power sensitive applications. 

Thus, functional units have been integrated onto the CPU 
core in order to reduce the total system cost, rather than to 
increase the inherent performance of the integer engine. 
Nevertheless, the R3051 family is able to offer 35 VUPS 
performance at 40MHz without requiring external SRAM or 
caches. 

The R3081 extends the capabilities of the R3051 family, by 
integrating additional resources into the same pin-out. The 
R3081 thus extends the range of applications addressed by 
the R3051 family, and allows designers to implement a single, 
base system and software set capable of accepting a wide 
variety of CPUs, according to the price/performance goals of 
the end system. 

In addition to the embedded applications served by the 
R3051 family, the R3081 allows low-cost, entry level computer 
systems to be constructed. These systems will offer many 


times the performance of traditional PC: systems, yet cost 


approximately the same. The R3081 is able to run any 
standard R3000A operation system, including ACE UNIX. 
Thus, the R3081 can be used to build a low-cost ARC 
compliant system, further widening the range of performance 
solutions of the ACE Initiative. 

An overview of this device, and quantitative electrical 
parameters and mechanical data, is found in this data sheet; 
consult the "H3081 Family Hardware User's Guide” for a 
complete description of this processor. 


DEVICE OVERVIEW 


As part of the R3051 family, the R3081 extends the offering 


of a wide range of functionality in a compatible interface. The 

R3051 family allows the system designer to implement a 

single base system, and utilize interface-compatible processors 

of various complexity to achieve the price-performance goals 
of the particular end system. 

Differences among the various family members pertain to 
the on-chip resources of the processor. Current family members 
include: 
¢ TheR3052E, which incorporates an 8kB instruction cache, 

a 2kB data cache, and full function memory management 

unit (MMU) including 64-entry fully associative Translation 

Lookaside Buffer (TLB). 

*« The R3052, which also incorporates an 8kB instruction 
cache and 2kB data cache, but does not include the TLB, 
and instead uses a simpler virtual to physical address 
mapping. 

¢ The R3051E, which incorporates 4kB of instruction cache 
and 2kB of data cache, along with the full TuGeOG MMU/ 
TLB of the R3000A. 


* The R3051, which incorporates 4kB of instruction cache 
and 2kB of data cache, but omits the TLB, and instead uses 

a simpler virtual to physical address mapping. 
¢ TheR3081E, which incorporates a 16kB instruction cache, 

a 4kB data cache, and full function memory management 

unit (MMU) including 64-entry fully associative Translation 

Lookaside Buffer (TLB). The cache on the R3081E is user 

configurable to an 8kB Instruction Cache and 8kB Data 

Cache. 
¢ The R3081, which incorporates a 16kB instruction cache, 

a 4kB data cache, but uses the simpler memory mapping 

of the R3051/52, and thus omits the TLB. The cache on the 

R3081 is user configurable to an 8kB Instruction Cache and 

8kB Data Cache. 

Figure 1 shows a block level representation of the functional 
units within the R3081E. The R3081E could be viewed as the 
embodiment of a discrete solution built around the R3000A 
and R3010A. However, by integrating this functionality on a 
single chip, dramatic cost and power reductions are achieved. 


CPU Core 

The CPU core is a full 32-bit RISC integer execution 
engine, capable of sustaining close to single cycle execution. 
The CPU corecontains a five stage pipeline, and 32 orthogonal 
32-bit registers. The R3081 uses the same basic integer 
execution core as the entire R3051 family, which is the 
R3000A implementation of the MIPS instruction set. Thus, the 
R3081 family is binary compatible with the R3051, R3052, 
R3000A, R3001, and R3500 CPUs. In addition, the R4000 
represents an upwardly software compatible migration path to 
still higher levels of performance. 

The execution engine in the R3081 uses a five- -Stage 
pipeline to achieve near single-cycle instruction execution 
rates. A new instruction can be initiated in each clock cycle; 
the execution engine actually processes five instructions 
concurrently (in various pipeline stages). Figure 2 shows the 
concurrency achieved in the R3081 execution pipeline. 
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Figure 2. R3081 5-Stage Pipeline 
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VIRTUAL 


* sete tek t bdae 
Kernel Mapped 
(kseg2) 
OxcO0O000000 | 
ee Kernel Uncached 
. (kseg1) 
Oxa0000000 
Kernel Cached 
| (kseg0) 
0x80000000 
User Mapped 
Cacheable 
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| Ox00000000 





PHYSICAL 


Physical 3548 MB 


Memory — 


Figure 3. Virtual to Physical Mapping of Extended Architecture Versions 


System Control Co-Processor | 

The R3081 family also integrates on-chip the System 
Control Co-processor, CPO. CPO manages both the exception 
handling capability of the R3081, as well as the virtual to 
physical address mapping. ; 

As with the R3051 and R3052, the R3081 offers two 
versions of memory management and virtual to physical 
address mapping: the extended architecture versions, the 
_ R8051E, R38052E, and R3081E, incorporate the same MMU 


as the R3000A. These versions contain a fully associative 64- 


entry TLB which maps 4kB virtual pages into the physical 
address space. The virtual to physical mapping thus includes 
kernel segments which are hard-mapped to physical 
addresses, and kernel and user segments which are mapped 
page by page by the TLB into anywhere in the 4GB physical 
address space. In this TLB, 8 pages can be “locked” by the 
kernel to insure deterministic response in real-time applications. 
Figure 3 illustrates the virtual to physical mapping found in the 
R3081E. 





The Extended architecture versions of the R3051 family 
(the R3051E, R3052E, and R3081E) allowthesystem designer 
to implement kernel software which dynamically manages 
User task utilization of system resources, and also allows the © 
Kernel to protect certain resources from User tasks. These 
capabilities are important in general computing applications 
such as ARC computers, and are also important in a variety of 
embedded applications, from process control (where protection 
may be important) to X-Window display systems (where 
virtual Memory management can be used). The MMU can 
also be used to simplify system debug. _—y 

R3051 family base versions (the R3051, R3052, and 
R3081) remove the TLB and institute a fixed address mapping 
for the various segments of the virtual address space. These 
devices still support distinct kernel and user mode operation, 
but do not require page management software, leading to a 
simpler software model. The memory mapping used by these 
devices is shown in Figure 4. Note that the reserved spaces 
are for compatiblity with future family members, which may 
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Kernel Cached — Gut ened 2048 MB — 
. (kseg0) Tasks 
0x80000000 
Renae Inaccessible — 512 MB 
| (kuseg) Kernel Boot f[ i 
000000000 and /O 512 MB 2889 drw 04 


Figure 4. Virtual to Physical Mapping of Base Architecture Versions 
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map on-chip resources to these addresses. References to 
these addresses in the R3081 will be translated in the same 
fashion as the rest of their respective segments, with no traps 
or exceptions signalled. 

When using the base versions of the architecture, the 
system designer can implement a distinction between the 
user tasks and the kernel tasks, without having to implement 
page management software. This distinction can be 
implemented by decoding the output physical address. In 
systems which do not need memory protection, and wish to 
have the kernel and user tasks operate out of the same 
memory space, high-order address lines can be ignored by 
the address decoder, and thus all references will be seen in 
the lower gigabyte of the physical address space. 


Floating Point Co-Processor 

The R3081 also integrates an R3010A compatible floating 
point accelerator on-chip. The FPA isa high performance co- 
processor (co-processor 1 to the CPU) providing separate 
add, multiply, and divide functional units for single and double 
precision floating point arithmetic. The floating point accelerator 
features low latency operations, and autonomous functional 
units which allow differing types of floating point operations to 
function concurrently with integer operations. The R3010A 
appears to the software programmer as a simple extension of 
the integer execution unit, with 16 dedicated 64-bit floating 
point registers (Software references these as 32 32-bit registers 
when performing loads or stores). Figure 5 illustrates the 
functional block diagram of the on-chip FPA. 
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Clock Generator Unit 

The R3081 is driven from a single input clock which can be 
either at the processor rated speed, or at twice that speed. On- 
chip, the clock generator unit is responsible for managing the 
interaction of the CPU core, caches, and bus interface. The 
R3081 includes an on-chip clock doubler to provide higher 
frequency signals to the internal execution core; if 1x clock 
mode is selected, the clock doubler will internally convert it to 
a double frequency clock. The 2x clock mode is provided for 
compatiblity with the R3051.The clock generator unit replaces 
the external delay line required in R3000A based applications. 


Instruction Cache 

The R3081 implements a 16kB Instruction Cache. The 
system may choose to repartition the on-chip caches, so that 
the instruction cache is reduced to 8kB but the data cache is 
increased to 8kB. The instruction cache is organized with a 
line size of 16 bytes (four entries). This large cache achieves 
hitrates in excess of 988% in most applications, and substantially 
contributes to the performance inherent in the R3081. The 
cache is implemented as a direct mapped cache, and is 
capable of caching instructions from anywhere within the 4GB 
physical address space. The cache is implemented using 
physical addresses (rather than virtual addresses), and thus 
does not require flushing on context switch. 

The instruction cache is parity protected over the instruction 
word and tag fields. Parity is generated by the read buffer 
during cache refill; during cache references, the parity is 
checked, and in the case of a parity error, a cache miss is 
processed. 
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Calf 
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Figure 5. FPA Functional Block Diagram 
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Data Cache 

~ The R8081 incorporates an on- er data cache of AKB, 
organized as a line size of 4:bytes (one word). The R3081 
allows the system to reconfigure the on-chip cache from the 
default 16kB |-Cache/4kB D-Cache to kB. of Instruction and 
8kB of Datacaches. . 

The relatively large data cache achieves hit rates in excess 
of 95% in most applications, and contributes substantially to 
the performance inherent in the R3081. As with the instruction 
cache, the data cache is implemented as.a direct mapped 
physical address cache. Thecacheis capable of mapping any 
word within the 4GB physical address space. 

The data cache is implemented as a write-through cache; 
to insure that main memory is always consistent with the 
internal cache. In order to minimize processor stalls due to 
data write operations, the bus interface unit incorporates a 4- 
deep write buffer which captures address and data at. the 
processor execution rate, allowing it tobe retired to main 
memory at a much slower rate without. impacting system 
performance. :Further, support has been provided to allow 
hardware: based data cache coherency in a multi-master 
environment, such as one utilizing DMA from I/O to memory. 

The data cache is parity protected over the data and tag 
fields. Parity is generated by the read buffer during cache 
refill; during cache references, the parity is checked, and inthe 
case of a parity error, a cache miss is nv eee ees 


ie. Bus Interface Unit 


The R3081 uses its large internal caches to sravide ‘fa 
majority of the bandwidth requirements of the execution 
engine, and thus can utilize a simple bus interface connected 
to slower memory devices. Alternately, a high-performance, 
low cost secondary cache can be implemented, allowing the 
processor to increase performance in systems where bus 
bandwidth is a performance limitation. 

As part of the R3051 family, the R3081 bus interface utilizes 
a 32-bit address and data bus multiplexed onto a single set of 
pins. The bus interface unit also provides an ALE (Address 


Latch Enable) output fees to de- eee the A/D bus, cea 
simple handshake signals to process. CPU read and write 
requests.:In-addition to the read and write interface, the R3051 
family incorporates a DMA arbiter, to allow an external master 


- to control the external bus... 


‘The R3081 also supports hardware based Ricke coherency 
during DMA writes. The R3081 can invalidate a specified line 
of data cache, or in. fact can Peon burst mivaligstone euning 
burst DMA writes.: 

The R3081 incorporates a 4- asepN write butter to decouple 
the speed of the execution engine from the speed of the 
memory system. The write buffers capture andFlFO processor 
address and data information in store operations, and present 
it to the bus interface: as write transactions at the rate the 
memory system can accommodate. 

The R3081 read interface performs: both single. datum 
reads and quad word reads. Single reads work with a simple 
handshake, and quad word reads can either utilize the simple 
handshake (in lower performance, simple systems) or utilize 
a tighter timing mode when the memory system can burst data 
at the processor clock rate. Thus, the system designer can 
choose to utilize page or nibble mode DRAMs (and possibly 
use interleaving, if desired, in high-performance systems), or 
use simpler techniques.to reduce complexity. 

In order to accommodate slower quad word reads, the 
R3081 incorporates a 4-deep read buffer FIFO, so that the 
external interface can queue up data within the processor 
before releasing it to perform a burst fill of the internal caches. 

The R3081 is R3051 superset compatible in its bus interface. 
Specifically, the R3081 has additional support to simplify the 
design of very high frequency systems. This support includes 
the ability to run the bus interface at one-half the processor 
execution rate, as well as the ability to slow the transitions 
between reads and writes to provide extra buffer disable time 
forthe memory interface. However, it is still possible to design 
a system which, with no modification to the PC Board or 
software, can accept either an R3051, R3052, or R3081. 
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SYSTEM USAGE 

The IDT R3051 family has been specifically designed to 
allow a wide variety of memory systems. Low-cost systems 
can use slow speed memories and simple controllers, while 
other designers may choose to incorporate higher frequencies, 
faster memories, and techniques such as DMA to achieve 
maximum performance. The R3081 includes specific support 
for high perfromance systems, including signals necessary to 
implement external secondary caches, and the ability to 
perform hardware based cache coherency in multi-master 
systems. 

Figure 6 shows a typical system implementation. 
Transparent latches are used to de-multiplex the R3081 
address and data busses from the A/D bus. The data paths 
between the memory system elements and the A/D bus is 
managed by simple octal devices. Asmall set of simple PALS 






R 


Address/ 
Data 


is used to control the various data path elements, and to 
control the handshake between the memory devices and the 
CPU. IDT has implemented the R3720/21 support chip set 
specifically tailored to R3051 family systems. This chip set 
directly interfaces the processor to DRAM, simplifying design 
and eliminating discrete logic chips and PAL devices. 

Depending onthe cost vs. performance tradeoffs appropriate 
to a given application, the system design engineer could 
include true burst support from the DRAM to provide for high- 
performance cache miss processing, or utilize a simpler, 
lower performance memory system to reduce cost and simplify 
the design. Similarly, the system designer could choose to 
implement techniques such as external secondary cache, or 
DMA, to further improve system performance. 
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Figure 6. R3081 RISChipset Based System 
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DEVELOPMENT SUPPORT | | | 
~The IDT R3051 family is supported by a rich set of 
development tools, ranging from system. simulation tools 
through PROM monitor. and debug support, applications 
software and utility libraries, logic analysis tools, sub-system 
modules, and shrink wrap operating systems. The R3081, 
which is pin and software compatible with the R3051, can 
directly utilize these existing tools to reduce time to market. 
Figure 7 is an overview of the system development process 
typically used when developing R3051 family applications. 
The R3051 family is supported in all phases of project 
development. These tools allow timely, parallel development 
of hardware and software for R3051 family applications, and 
include tools such as: 
¢ A program, Cache-R3051, which allows the Serninance 
of an R3051 family system to be modeled and understood 
without requiring actual hardware. 
* Sable, an instruction set simulator. 
¢ Optimizing compilers from MIPS, the acknowledged leader 
in optimizing compiler technology. 


* Cross ss lsemiect tools, available ina variety .of 


development environments. 
> The high- performance IDT floating point library software, 
including transcendental functions and IEEE compliant 
exception handlers. 
* The IDT Evaluation Board, which includes RAM, EPROM, 
/O, and the IDT PROM Monitor. 
* The IDT Laser Printer System board, which directly drives 
_a low-cost print engine, and runs Microsoft Truelmage™ 
Page Description Language on top. of PeerlessPage™ 
Advanced Printer Controller BIOS. 
¢ Adobe PostScript™ Page Description Language, ported to 
the R3000 instruction set, runs on the IDT R3051 family. 
* IDT/sim, which implements a full prom monitor (diagnostics, 
remote debug support, peek/poke, etc.). 
- IDT/sae, whichimplements a run-time support package for 
R3051 family systems. 
¢ ACE UNIX operating system; bringing ACE compatibility. 


System System System 
Architecture Development Integration 
Evaluation and Verfification 


Phase 


SABLE Simulator 
DBG Debugger 
‘PIXIE Profiler 
MIPS Compiler Suite 
Stand-Alone Libraries 
Floating Point Library 
Cross Development Tools |\ 


Adobe PostScript PDL 
MicroSoft Truelmage PDL }» 
PeerlessPage PrinterOS |: 


Cache-3051 
SPP 
Benchmarks | 
Evaluation Board 
Laser Printer System 
X-Terminal System 


Figure 7. R3051 Family Development Toolchain 


Cache-3051 
Hardware Models 
General CAD Tools 
Evaluation Board 
Laser Printer System 
X-Terminal System 
Support Chips 





ACE UNIX 
Logic Analysis — 
Diagnostics 
IDT/sim 
IDT/sae 
In-Circuit Emulation 
Remote Debug 
Real-Time OS 
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PERFORMANCE OVERVIEW 


The R3081 achieves a very high-level of performance. This 
performance is based on: 
An efficient execution engine. The CPU performs ALU 
operations and store operations in a single cycle, and has 
an effective load time of 1.3 cycles, and branch execution 
rate of 1.5 cycles (based on the ability of the compilers to 
avoid software interlocks). Thus, the execution engine 
achieves over 35 VUPS performance when operating out 
of cache. 
A full featured floating point accelerator/co-processor. The 
R3081 incorporates an R3010A compatible floating point 
accelerator on-chip, with independent ALUs for floating 
point add, multiply, and divide. The floating point unit is fully 
hardware interlocked, and features overlapped operation 
and precise exceptions. The FPA allows floating point 
adds, multiplies, and divides to occur concurrently with 
each other, as well as concurrently with integer operations. 
Large on-chip caches. The R3051 family contains caches 
which are substantially larger than those on the majority of 
today’s microprocessors. These large caches minimize the 
number of bus transactions required, and allow the R3051 
family to achieve actual sustained performance very close 
to its peak execution rate. The R3081 doubles the cache 
available on the R3052, making it a suitable enging for 
many general purpose computing applications, such as 
ARC compliant systems. 
Autonomous multiply and divide operations. The R3051 
family features an on-chip integer multiplier/divide unit 
which is separate from the other ALU. This allows the CPU 
to perform multiply or divide operations in parallel with other 
integer operations, using a single multiply or divide 
instruction rather than “step” operations. 
Integrated write buffer. The R3081 features a four deep 
write buffer, which captures store target addresses and 
data at the processor execution rate and retires it to main 
memory atthe slower main memory access rate. Use of on- 
chip write buffers eliminates the need for the processor to 
stall when performing store operations. 
Burst read support. The R3051 family enables the system 
designer to utilize page mode or nibble mode RAMs when 
performing read operations to minimize the main memory 
read penalty and increase the effective cache hit rates. 
These techniques combine to allow the processor to achieve 
over 35 VUPS integer performance, 11MFlops of Linpack 
performance, and64,000 dhrystones without the use of external 
caches or zero wait-state memory devices. 

The performance differences between the various family 
members depends on the application software and the design 
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ofthe memory system. The impact of the various cache sizes, 
and the hardware floating point, can be accurately modeled 
using Cache-3051. Since the R3051, R3052, and R3081 are 
all pin and software compatible, the system designer has 
maximum freedom in trading between performance and cost. 
A system can be designed, and later the appropriate CPU 
inserted into the board, depending on the desired system 
performance. 


SELECTABLE FEATURES 


The R3081 allows the system designer to configure certain 
aspects of operation. Some of these options are established 
when the device is reset, while others are enabled via the 
Config registers: 

BigEndian vs. LittleEndian Byte Ordering. The part can be 
configured to operate with either byte ordering. ACE/ARC 
systems typically use Little Endian byteordering. However, 
various embedded applications, written originally for a Big 
Endian processor such as the MC680x0, are easier to port 
to a Big Endian system. 

Data Cache Refill of one or four words. The memory 
system must be capable of performing four word refills of 
instruction cache misses. The R3081 allows the system 
designer to enable D-Cache refill of one or four words 
dynamically. Thus, specialized algorithms can choose one 
refill size, while the rest of the system can operate with the 
other. 

Half-frequency bus mode. The processor canbe configured 
such that the external bus interface is at one-half the 
frequency of the processor core. This simplifies system 
design; however, the large on-chip caches mitigate the 
performance impact of using a slower system bus clock. 
Slow bus turn-around. The R3081 allows the system 
designer to space processor operations, so that more time 
is allowed for transitions between memory andthe processor 
on the multiplexed address/data bus. 

Configurable cache. The R3081 allows the system designer 
to use software to select either a 16kB Instruction Cache/ 
4kB Data Cache organization, or an 8kB Instruction/8kB 
Data Cache organization. 

Cache Coherent Interface. The R3081 has an optional 
hardware based cache coherency interface intended to 
support multi-master systems such as those utilizing DMA 
between memory and |/O. 

Optional 1x or 2x clockinput. The R3081 canbe driven with 
an R3051 compatible 2x clock input, or a lower frequency 
1x clock input. 
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THERMAL CONSIDERATIONS 

The R3081 utilizes special packaging techniques to improve 
the thermal properties of high-speed processors. Thus, the 
R3081 is packaged using cavity down packaging, with an 
embedded thermal slug to improve thermal transfer to the 
Suurrounding air. , 

The R3081 utilizes the 84-pin MQUAD package (the "Mu" 
package), which is an all aluminum package with the die 
attached to a normal copper lead-frame mounted to the 
aluminum casing. The MQUAD package allows for an efficient 
thermal transfer between the die and the case due to the heat 
spreading effect of the aluminum. The aluminum offers less 
internal resistance from one end of the package to the other, 
reducing the temperature gradient across the package and 
therefore presenting a greater area for convection and 
conduction to the PCB for a given temperature. Even nominal 
amounts of airflow will dramatically reduce the junction 
temperature of the die, resulting in cooler operation. The 
MQUAD packages available atall frequencies, and is pin and 
form compatible with the PLCC used for the R3051. Thus, 
designers can inter-change R3081's and R3051'sina particular 
design, without changing their PC Board. 










| OCA 

itiow (Rimin) |_© | 200 | 400 | 600 | 00 | 1000__ 

"Mu" Package | 22 | 14 [12 [11 [9 [es 

[PaAPackage | 2 [1s fe [7 |e [5s 
2889 tb! 01 


Table 1. Thermal Resistance (@ca) at Various Airflows 
(*estimated: final values tbd) 







Finally, the R3081 is also available in a cavity-down PGA 
with exposed thermal slug. As with the MQUAD package, this 
package is exiremely thermal efficient, and is appropriate in 
extreme temperature conditions, such as military systems. 

The R3081 is guaranteed in a case temperature range of 
0°C to +85°C. The type of package, speed (power) of the 
device, and airflow conditions, affect the equivalent ambient 
temperature conditions which will meet this specification. 

The equivalent allowable ambient temperature, Ta, canbe 
calculated using the thermal resistance from case to ambient 
(Oca) of the given package. The following equation relates 
ambient and case temperatures: 

Ta = Tc-P* Goa 

where P is the maximum power consumption at hot 
temperature, calculated by using the maximum Icc specification 
for the device. | 7 

Typical values for Oca at various airflows are shown in 
Table 1. 

Note that the R3081 allows the operational frequency to be 
turned down during idle periods to reduce power consumption. 
This operation is described in the R3081 Hardware User's 
Guide. Reducing the operation frequency dramatically reduces 
power consumption. 
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PIN DESCRIPTION 


Address/Data: A 32-bit time multiplexed bus which indicates the desired address for a bus transaction 
in one phase, and whichis used to transmit data between the CPU and external memory resources during 
the rest of the transfer. 


Bus transactions ‘on this bus are logically separated into two phases: during the first phase, information 
about the transfer is presented to the memory system to be captured using the ALE output. This 
information consists of: 


Address(31:4): The high-order address for the transfer is presented on A/D(31:4). 


BE(3:0): These strobes indicate which bytes of the 32-bit bus will be involved in 
the transfer, and are presented on A/D(3:0). 


During write cycles, the bus contains the data to be stored and is driven from the internal write buffer. 
On read cycles, the bus receives the data from the external resource, in either a single data 
transaction or in a burst of four words, and places it into the on-chip read buffer. 


During cache coherency operations, the R3081 monitors the A/D bus at the start of a DMA write to capture 
the write target address for potential data cache invalidates. 





Addr(3:2) Low Address (3:2) A 2-bit bus which indicates which word is currently expected by the processor. 
Specifically, this two bit bus presents either the address bits for the single word to be transferred (writes 


or single datum reads) or functions as a two bit counter starting at ‘00’ for burst read operations. 


During cache coherency operations, the R3081 monitors the Addr bus at the start of a DMA write to 
capture the write target address for potential data cache invalidates. 








Diagnostic Pin 1. This output indicates whether the current bus read transaction is due to an on- 
chip cache miss, and also presents part of the miss address. The value output on this pin is time 
multiplexed: 







Cached: During the phase in which the A/D bus presents address information, this 
| . pin is an active high output which indicates whether the current read is 
a result of a cache miss. 










During the remainder of the read operation, this output presents 
address bit (3) of the address the processor was attempting to 
reference when the cache miss occurred. Regardless of whether a 
cache miss is being processed, this pin reports the transfer address 
during this time. 


Miss Address (3): 








On write cycles, this output signals whether the data being written as retained in the on-chip data cache. 
The value of this pin is time multiplexed during writes: 






Cached: During the address phase of write transactions, this signal is an active 
| < high output which indicates that the store data was retainedin the on-chip 
data cache. 3 







Reserved: The value of this pin during the data phase of writes is reserved. 





Diagnostic Pin 0. This output distinguishes cache misses due to instruction references from those 
due to data references, and presents the remaining bit of the miss address. The value output on this 
pin is also time multiplexed: 


VD: - If the “Cached” Pin indicates a cache miss, then a high on this pin at this 
time indicates an instruction reference, and a low indicates a data 
reference. Ifthe read is not due to a cache miss but rather an uncached 
reference, then this pin is undefined during this phase. 


Miss Address (2): During the remainder of the read operation, this output presents 
; address bit (2) of the address the processor was attempting to 
reference when the cache miss occurred. Regardless of whether a 
cache miss is being processed, this pin reports the transfer address 
during this time. — 





During write cycles, the value of this pin during both the address and data phases is reserved. 
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PIN DESCRIPTION (Continued): 


ALE /O 


Address Latch Enable: Used to indicate that the A/D bus contains valid address information for 
the bus transaction. This signal is used by external al logic to capture the address for the transfer, typically 
using transparent latches. 


During cache coherency operations, the R3081 monitors ALE at the start of aDMA wae to capture the 
write target address for potential data cache invalidates. 


Pee Read: An output which indicates that the current bus transaction is a read. 


Write: An output which indicates that the current bus transaction is a write. 










ort 






~ During coherent DMA, this input indicates that the current transfer is a write. 


External Data Enable: This signal indicates that the A/D bus is no longer being driven by the processor 
during read cycles, and thus the external memory system may enable the drivers of the memory 
system onto this bus without having a bus conflict occur. During write cycles, or when no bus 
transaction is occurring, this signal is negated, thus disabling the external memory drivers 






Burst Transfer/Write Near: On read transactions, the Burst signal indicates that the current bus read 
is requesting a block of four contiguous words from memory. This signal is asserted only in read cycles 
due to cache misses; it is asserted for all |-Cache miss read cycles, and for D-Cache miss read cycles 
if quad word refill is currently selected. | 


On write transactions, the WrNear output tells the external memory system that the bus interface unit 
is performing back-to-back write transactions to an address within the same 512 word page as the prior 
write transaction. This signal is useful in memory systems which empIey page mode or static column 
DRAMs, and allows near writes to be retired quickly. 





Acknowledge: An input which indicates to the device that the memory system has sufficiently 
processed the bus transaction, and that the CPU may either terminate the write cycle or 
_ process the read data from this read transfer. 







During Coherent DMA, this input indicates that the current write transfer is completed, and that the 
internal invalidation address counter should be incremented. 











Read Buffer Clock Enable: An input which indicates to the device that the memory system has 
_placed valid data on the A/D bus, and that the processor may move the data into the on-chip Read 
Buffer. : 


System Reference Clock: An output from the CPU which reflects the timing of the internal 
processor "Sys" clock. This clock is used to control state transitions in the read buffer, write buffer, 

memory controller, and bus interface unit. This clock will either be at the same frequency as the CPU 
execution rate clock, or at one-half that frequency, as selected during reset. 

























DMA Arbiter Bus Request: An input to the device which requests that the CPU tri-state its bus 
. interface signals so that they may be driven by an external master. 








DMA Arbiter Bus Grant. An output from the CPU used to acknowledge that a BusFeq has been 
detected, and that the bus is relinquished to the external master. 





invalidate Request. An input provided by an external DMA controller to request that the CPU invalidate 
the Data Cache line corresponding to the current DMA write lege address. This signal is the same pin 
as Diag(0) 


Coherent DMA Request. An input used by the external DMA controller to indicate that the requested 









DMA operations could involve hardware cache coherency. This signal is the ‘Rsvd(0) of the R3051. 






- Branch Condition Port: These external signals are internally connected to the CPU signals 
CpCond(3:0). These signals can be used by the branch on co-processor condition instructions as input 
ports. There are two types of Branch Condition inputs: the SBrCond inputs have special internal 

logic to synchronize the inputs, and thus may be driven by asynchronous agents. The direct Branch 
Condition inputs must be driven synchronously. Note that BrCond(1) is used by the internal FPA, and 
thus is not available on an external pin. 


SBrCond(3:2) 
BrCond(0) 













BusError 


Bus Error: Input to the bus interface unit to terminate a bus transaction due to an external bus error. 
This signal is only sampled during read and write operations. If the bus transaction is a read operation, 
then the CPU will take a bus error exception. . os. oe — | 





77) 
~~ 
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PIN DESCRIPTION (Continued): 


Processor Interrupt: During normal operation, these signals are logically the same as the Int(5:0) 
signals of the R3000. During processor reset, these signals perform mode initialization of the CPU, but 
in a different (simpler) fashion than the interrupt signals of the R3000. 






There are two types of interrupt inputs: the Sint inputs are internally synchronized by the processor, 
and may be driven by an asynchronous external agent. The direct interrupt inputs are not internally 
synchronized, and thus must be externally synchronized to the CPU. The direct interrupt inputs have 
one cycle lower latency than the synchronized interrupts. Note that the interrupt used by the on-chip 
FPA will not be monitored externally. 


Clkin Master Clock Input: This input clock can be provided at the execution frequency of the CPU (1x clock 
mode) or at twice that frequency (2x clock mode), as selected at reset.. 

peel 5 

Revd(4:1) 






















Master Processor Reset: This signal initializes the CPU. Mode selection is performed during 
the last cycle of Reset. 







Reserved: These four signal pins are reserved for testing and for future revisions of this device. 
Users must not connect these pins. Note that Rsvd(0) of the R3051 is now used for the CohReqg input 


pin. 
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ABSOLUTE MAXIMUM aanneee 3) 
[Sumbett_—_Rating _ Commercil|_tary Unit 


Terminal Voltage - 
with Respect 
to GND - 


Operating Case a to #89 —55 to +125 
Temperature 


Temperature 









0.5 to +7.0 








Ce 








i ie —55 to Porc poe to ee 





Under Bias 


) laa Storage ind, 10 4125 aot to ave ak | 
Temperature |. 
ca Input Voltage . |-0.5 to +7. 


NOTES: 2889 tbl 05 

1. Stresses greater than those listed under ABSOLUTE MAXIMUM RATINGS 
may cause permanent damage to the device. This is a stress rating only 
and functional operation of the device at these or any other conditions - 
above those indicated in the operational sections of this specifications not - 
implied. Exposure to absolute maximum rating conditions for extended 
periods may affect reliability. 

2. ViINminimum = —3.0V for pulse width less than 15ns. 

' Vin should not exceed Vcc +0.5V. 

3. Notmore than one outputshould be shorted ata time. Duration of the short 
should not exceed 30 seconds. 





AC TEST CONDITIONS—3081 


Cv | input cs Votago | 8 
[Preston | 
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Symbol |_Paremeter_{_Mip._ {Moe 
ie ~ 
| Input Low vottage “| — | 0 
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Commercial | 
_| Commercial. | 


Of -o5te+70] V | 


RECOMMENDED OPERATING = 
TEMPERATURE AND SUPPLY dco 












bcs nerature 
[eC 2125 










Military - 


3.345% 









“O°C to 785°C 









0°C to +85°C 
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OUTPUT LOADING FOR AC TESTING 
nee To Device 
+1.5V Under Test 
ah CLD 
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| Signal | cu 
SysClk 50 pf 
All Others 25 pf 
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DC ELECTRICAL CHARACTERISTICS 3081— (Tc 


| VOH Output HIGH Voltage Vcc = Min., lOH = -4mA 


VOH 
VOL Output LOW Voltage Vcc =Min., lol = 4mA 
VIH ) 
Input LOW Voltage(’) 
2 

IN 
loc 
HH 
HL 
loz 







Output Capacitance) 


Flin | Input HIGH Leakage ViH = VCC 
Pie | Input LOW Leakage Vit = GND 
files Output Tri-state Leakage | VOH=2.4V, VOL=0.5V 


NOTES: 

1. Vit Min. = —3.0V for pulse width less than 15ns. Vit should not fall below -O 
2. VIHs and Vits apply to Clkin and Reset. 

3. ViH should not be held above Vcc + 0.5V. 

4. Guaranteed by design. 









DC ELECTRICAL CHARACTERISTICS 3081L— (T 


Symbol Test Conditions 


Vcc = Min., lOH = -4mA 

Vcc = Min., lo. = 4mA 
| Input Capacitancel®) 

| 

| 


Output Capacitance(4) 


Operating Current Voc =3.3V, TA= 25°C 


CoutT 


[cour | 


OH 
OL 
/iH 
IL 
iN 
cc 
IH 
IL 
OZ 


i [put LOW Leakage [Vaz GND 
[ioe | ouput Teste Leakage [Vow=24¥, Vor=osv 
NOTES: 


1. Vit Min. =—3.0V for pulse width less than 15ns. Vit should not fall below -O 
VIHs and VILs apply to Clkln and Reset. 

ViH should not be held above Vcc + 0.5V. 

Guaranteed by design. 


2. 
3. 
4. 
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= 0°C to +85°C, Voc = +5.0V +5%) 





[zowre [_25une | sa.somne [sour 

as — as — | 
= 0s 
20 = 
es ee 
= 04 
=o [= 
=e [= 0 [= 0 or 
r 


-100 100};-100 100 |}-100 100 {-100 100 


.5V for larger periods. 


= 
5 | 
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| 2.4 — 
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AC ELECTRICAL CHARACTERISTICS 3081 " 7)— (Tc = 0°C to +85°C, Voc = +5.0V +5%) 






7 . Description a 
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SysCik (full frequency mode) | Pulse Width(®) 
SysClk (full frequency mode) | Clock High Time’) 
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NOTES: 

1.. All timings referenced to 1.5V. 

2. The AC values listed here reference timing diagrams contained in the R3081 Family Hardware User's Manual. 

3. Guaranteed by design. . ar . . 

4. This parameter is used to derate the AC timings according to the loading of the system. This parameter provides a deration for loads over the specified 
test condition; that is, the deration factor is applied for each 25pF over the specified test load condition. . 

5. In 1x clock mode, t22 is replaced by t44/2. 


5.6 15 


IDT79R3081 RiSController COMMERCIAL TEMPERATURE RANGE 





AC ELECTRICAL CHARACTERISTICS 3081 (continued) 2)— (Tc = 0°C to +85°C, Voc = +5.0V +5%) 


Symbol | | eee. 


Min. Max. |Min. Max. tin. Max. | in. Max. 
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NOTES: 

1. All timings referenced to 1.5V. 

2. The AC values listed here reference timing diagrams contained in the A3081 Family Hardware User's Manual. 

3. Guaranteed by design. 

4. This parameter is used to derate the AC timings according to the loading of the system. This parameter provides a deration for loads over the specified 
test condition; that is, the deration factor is applied for each 25pF over the specified test load condition. 

5. In 1x clock mode, t22 is replaced by t44/2. 
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AC ELECTRICAL CHARACTERISTICS 3081L  2)— (Tc = 0°C to +85°C, Voc = +3.3V45%) 


Symbol ir eee oe 
BusReq, Ack, BusError, _| Set-up to SysCik rising | 3 
RdCEn, CohReq . 
, | Hold from SysClk rising 


























: 
| 
| 








-—- 
—_ 


) 
” 









+ 
NO 
3 








) uo 
| | 


=~); ODEN]. 
Py oF qld. 


a2 
Nh 
se) 


-~ ot 
ine) 


Set-up to SysClk failing 
Hold from SysCIk falling 





| 

— 
fo) 
> 
Q 


rad 








-_ 
BS 









Asserted from SysClk rising 





Negated from SysClk falling 


ot 
0) 





o 
J 
3 
fe) 








3 


7] 


Valid from SysClk rising 
Asserted from SysClk rising 
ALE Negated from SysClk falling 





” 


a 
Oo 
> 
i 
Oo 


5S 
” 








Hold from ALE negated 
Asserted from SysClk falling 


Oo 
os 
ay 
m 
3 


t11 





ns 


Wr Tri-state from SysClk rising 







Asserted from A/D tri-state!) 





_ | Driven from SysClk rising(?) 


rN 
O 
| 
© 
a 
a) 














Wr, Rd, DataEn, Burst/WrNear | Negated from SysCik falling 





Driven from SysClik falling 


Addr(3:2) Valid from SysCik 


t16 





vt Cn a 
a —_ 


Valid from SysClik 





oe 
s) 


18 

9g 
t20 
t21 


t22 Clkin (2x clock mode) Clock Period 
t23 Reset Pulse Width from Vcc valid 


t24 
t25 
26 
t27 
28 


Tri-state from SysClk falling 





SysClk falling to data out 


ue 
= 
& 
s) 
—_ —_ 
PEE REE ERECPTETELELE TEE: 






Clkin (2x clock mode 


Pulse Width High 







of | ele ha ° 
| a nfl jefe la 


) 
(2x clock mode) Pulse Width Low 


Q 
cal 
=) 


15 250 12.5 250 ns 
200 ak 
32 





Minimum Pulse Width 





Set-up to SysCik falling 





7 a 
2 ee: 
_ 10 
2 Sere? 
= 15 
_— 7 
a2 10 
25 —-250 
2000 — 
32 = 
10 an 


Mode set-up to Reset rising —| 
Mode hold from Reset rising | 
Int, SBrCond Set-up to SysClk falling 6 


ta af 


> 
” 





3 


Ss 


~ 





ot 
NO 
wo 
92) 


3| 





Int, SBrCond Hold from SysClik falling 
Int, BrCond Set-up to SysClk falling me 5 | 


30 
31 Hold from SysCik falling 

tsys SysClk (full frequency mode) | Pulse Width!) 2*t22 2*t82 | a*t22 2*te2 | 2*t22 2*t22 | 2*t22 ©2*t22 
132 | 22-2 12242 | t22-2 12242 | 122-1 12241] t22-1 12241 


SysClk (full frequency mode) | Clock High Time(®) 


t 


200 — 
32 — 
3 eae 
7 a 
2 de 











a : 





2889 tol 10 
NOTES: 
1. All timings referenced to 1.5V. 
2. The AC values listed here reference timing diagrams contained in the R3081 Family Hardware User's Manual. 
3. Guaranteed by design. 
4. This parameter is used to derate the AC timings according to the loading of the system. This parameter provides a deration for loads over the specified 
test condition; that is, the deration factor is applied for each 25pF over the specified test load condition. 
5. In 1x clock mode, t22 is replaced by t44/2. 
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AC ELECTRICAL CHARACTERISTICS 3081L (continued) 2)— (Tc = 0°C to +85°C, Vcc = +3.3V +5%) 
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NOTES: 

1. All timings referenced to 1.5V. 

2. The AC values listed here reference timing diagrams contained in the R3081 Family Hardware User's Manual. 

3. Guaranteed by design. 

4. This parameter is used to derate the AC timings according to the loading of the system. This parameter provides a deration for loads over the specified 
test condition; that is, the deration factor is applied for each 25pF over the specified test load condition. 

5. In 1x clock mode, t22 is replaced by t44/2. 


3 
t3 
t36 
3 
t39 

t42 

t43 





2889 tbl 11 


5.6 18 





IDT79R3081 RiSController . COMMERCIAL TEMPERATURE RANGE 





PIN CONFIGURATIONS 
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NOTE: 
Reserved Pins must not be connected. 
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PIN CONFIGURATIONS (CONTINUED) 





R3081 
84-Pin Ceramic Pin Grid Array 


(Cavity Down) 


Bottom View 





2889 drw 10 
84-Pin PGA with Integral Thermal Slug 


BottomView 


NOTE 
Reserved Pins must not be connected. 
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Figure 8 (d). R3081 Clocking (2x clock input mode, full-frequency bus) 
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Figure 9. Power-On Reset Sequence 
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Figure 11. Mode Selection and Negation of Reset 
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Figure 12. Single Datum Read in R3081 
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Figure 13. R3081 Burst Read 
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Figure 14 (b). End of Throttled Quad Read 
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Figure 15, R3081 Write Cycle 2889 drw 22 
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Figure 16. Request and Relinquish of R3081 Bus to External Master 2889 drw 23 
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Figure 17. R3081 Regaining Bus Mastership 
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Figure 18. Synchronized Interrupt Input Timing 
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Figure 19. Direct Interrupt Input Timing 
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Figure 20. Synchronized Branch Condition Input Timing 
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Figure 21. Direct Branch Condition Input Timing 
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- Figure 22. Coherent DMA Request 
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Figure 23. Beginning of Coherent DMA Write 
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Figure 24. Cache Word Invalidation 
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Figure 25. End of Coherent Write 
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Figure 26. End of Coherent DMA Request 
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84-PIN PGA (CAVITY DOWN) 
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NOTES: 

All dimensions are in inches, unless otherwise noted. 
BSC—Basic lead Spacing between Centers 

Symbol “M” represents the PGA matrix size. 

Symbol “N” represents the number of pins. 
Chamfered corners are IDT’s option. 

Shaded area indicates integral metallic heat sink. 
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NOTES: 

. All dimensions are in inches, unless otherwise noted. 

BSC—Basic lead Spacing between Centers. 

D & E do not include mold flash or protutions. 

Formed leads shall be planar with respect to one another and within .004” at the seating Pian? 
. ND & NE represent the number of leads in the D & E directions respectively. 

D1 & Ei should be measured from the bottom of the package. 

. 84-pin MQUAD is pin & form compatible with 84-pin PLCC of R3051/2 
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ORDERING INFORMATION 
XXXXX — XX X X 
I0T ———_——- 
Device Type Speed Package Process/ 
emp. Range 

Blank Commercial Temperature Range 
'B' Compliant to MIL-STD-883, Class B 
'M' Military Temperature Range Only 
'MJ' 84-Pin MQUAD 
'G' 84-Pin PGA with Integral Thermal Slug 
‘20' 20.0 MHz 
'25' 25.0 MHz 
‘'33' 33.33 MHz 
‘40' 40.0 MHz 
79R3081 No TLB; Vcc = 5V 
79R3081E With TLB; Vcc = 5V 
79R3081L No TLB; Vcc = 3.3V 
79R3081L With TLB; Vcc = 3.3V 2889 drw 36 

VALID COMBINATIONS 

IDT 79R3081 — 20, 25, 33, 40 All packages 


79R3081E — 20, 25, 33, 40 All packages 
79R3081L — 20, 25, 33, 40 All packages 
79R3081LE — 20, 25, 33,40 ~— All packages 


79R3081M, B— 20, 25 PGA Package Only 
79R3081EM, B — 20, 25 PGA Package Only 
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THIRD GENERATION IDT79R4000, 


64-BIT SUPER-PIPELINED PRELIMINARY 


RISC MICROPROCESSOR 


Integrated Device Technology, Inc. 





FEATURES: — 16KB instruction; 16KB data cache (R4400) 
















* True 64-bit microprocessor — Flexible MMU with large TLB 
— 64-bit integer operations . Standard operating system support includes: 
— 64-bit floating-point operations — Microsoft Windows NT 
— 64-bit registers — UNISOFT UNIX System V.4 
— 64-bit virtual address space . Fully software compatible with R3000A 32-bit RISC 
* High-performance microprocessor Processor Family | 
— Super-pipelined architecture supports 150MIPS * 50, 67 and 75MHz clock frequencies ~ 
at 75MHz * 64GB physical address space» _— 
— No issue restrictions for dual instruction issue * Processor family for a wide variety of applications 
— Over 80 VUPs performance at 75MHz clock — Desktop workstations 
frequency — Deskside or departmental servers 
- High level of integration — High-performance embedded applications 
— 64-bit integer CPU — Tightly coupled multi-processing systems 
— 64-bit floating-point accelerator — Fault tolerant Syrians 
— 8KB instruction cache; 8KB data cache (R4000) 
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DESCRIPTION: 
- The IDT79R4000 family supports a wide Variety of proces- 


sor based applications, from 32-bit ARC compliant desktop 


systems through high-performance, 64-bit OLTP systems 
manipulating large data bases in a multi-processor based 
system. The IDT79R4000/R4400 family offers a broad range 
of price-performance options for high-performance systems, 
allowing the system architect unprecedented degrees of free- 
dom in making price-performance tradeoffs:. 

The IDT R4000 family provides complete upward applica- 
tion-software compatibility with the IDT79R3000 family of 
microprocessors, including the IDT79R3000A and the IDT 
RISController™ family (the IDT79R3001 and IDT79R3051 
family). Microsoft Windows NT and UNISOFT UNIX V.4 
operating systems insure the. availability of thousands of 
applications programs, geared to provide a complete solution 
to a large number of processing needs. An array of develop- 
ment tools facilitates the rapid development of R4000-based 
systems, enabling a wide variety of customers to take advan- 
tage of the MIPS Open Architecture philosophy. 

The R4000 family achieves a unique balance between 
high-integer and high-floating-point performance. The key to 
this balance is the super-pipelined architecture of the proces- 
sor, which brings performance. gains to both integer and 
floating-point intensive. programs without requiring re-com- 
pilation to take advantage of the architectural advancement. 


"The super-pipeline architecture is well-balanced in the R4000 


E family; the high-performance execution engine is assured of 


- a rapid and continual supply of instructions and data through 


the use of large on-chip caches, and a high-performance on- 
chip TLB. The result is consistently high-performance: over 80 
VUPS at 75MHz over a wide variety of realistic applications 
programs. 

The R4000 family also provides a compatible, timely, and 
necessary evolution path from 32-bit to true, 64-bit computing. 
The original design objectives of the R4000 clearly mandated 
this evolution path; the result is a true 64-bit processor fully 
compatible with 32-bit operating systems and applications. 

The 64-bit computing and addressing capability of the 
R4000 enables a wide variety of capabilities previously limited 
by a smaller address space. For example, the large address 
space allows operating systems with extensive file mapping; 
direct access to large files can occur without explicit I/O calls. 
Applications such as large CAD databases, multi-media, and 
high-quality image storage and retrieval all directly benefit 
from the enlarged address space. 

This data sheet provides an overview of the features and 
architecture of the IDT79R4000 CPU family. A more detailed 
description of the processor is available in the “A4000 User's 
Manual”, available from IDT. Further information on develop- 


| ment sispon applications notes, and maine erenaly prod- 
-- ucts are also available from your local IDT sales representative. 


_ This data sheet describes both the R4000 and the R4400. 
The primary difference between the two devices is the amount 
of on-chip primary cache: the R4000 contains 8kB each of 
primary instruction and data cache, while the R4400 doubles 
this to 16kB each of cache. Throughout this data sheet, the 
term "R4000" willbe used to describe characteristics common 
to both the R4000 and the R4400 devices. 


IDT79R4000 FAMILY MEMBERS | 

The IDT79R4000 processor is available in three different 
configurations: the IDT79R4000MC and IDT79R4000SC, 
which include a 128-bit wide secondary cache bus; and the 
IDT79R4000PC, with no secondary cache interface. Addi- 
tionally, the R4000PC and R4000SC are available with two 
different on-chip cache configurations: the R4000, with 8KB 
each of instruction and data cache; and the R4400, which 
doubles on-chip cache to 16KB each of instruction and cache. 


PC CONFIGURATION 

The IDT79R4000PC and 79R4400PC are available in a 
179-pin Pin Grid Array (PGA). This configuration does not 
support secondary cache or cache coherency, and is ideal for 
applications such as high-performance embedded control 
and low-cost desktop systems, where the on-chip caches 
provide enough performance and where cost, power, and 
board space must be kept to a minimum. By eliminating a 
secondary cache, a system can be designed with fewer parts 
and lower power consumption. 


SC CONFIGURATION 


The IDT79R4000SC and 79R4400SC are available in a 
447-pin Pin Grid Array (PGA). This processor supports a 
secondary cache interface and is ideal insystems where high 
performance Is desired. This component supports a 128kB to 
4mB secondary cache made from standard SRAMs. This 
flexibility allows system designers to make price/performance 
tradeoffs in cache subsystem Beslans: 


MC CONFIGURATION 

_The IDT79R4400MC is also available in the 447-pin Pin 
Grid Array (PGA). This processor supports asecondary cache 
and configurable multiprocessor cache coherency protocols. 
Like the "SC" configuration, this processor also supports a 
128kB to 4mB secondary cache made from standard SRAMs. 
The IDT79R4400MC is well suited for a range of designs from 
high performance desktop systems to fault tolerant multipro- 
cessor servers. 
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Figure 1. R4000 Family 8-Stage Super-Pipeline 
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Instruction cache access stage 1 
Instruction cache access stage 2 
Instruction address translation stage 1 
Instruction address translation stage 2 
Instruction tag check 

Instruction decode 

Register operand fetch 

Operation 

Data virtual address calculation 

Data cache access stage 1 

Data cache access stage 2 

Data load or store align 

Data/Instruction address translation stage 1 
Data/Instruction address translation stage 2 
Data tag check 

Instruction virtual address calculation 
Write-back to register file 





Figure 2. Pipeline Activities 
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HARDWARE OVERVIEW 


The IDT R4000 family brings a high-level of niearation | 


designed for high-performance computing. The key elements 
of the IDT R4000 are briefly described below. A more detailed 
description of each of these subsystems is available in other 
literature. 


Superpipelined Implementation - 

In order to achieve the high-performance desired for today’s 
applications and user's interfaces, the R4000 exploits instruc- 
tion level parallelism using a super-pipelined micro- 
architecture. | 

The R4000 uses an 8-stage superpipeline which places no 
issue restrictions on instruction issue. Thus, any two instruc- 
tions can be issued each master clock cycle under normal 
circumstances, leading to 150MIPS performance at 75MHz. 
One key advantage of this architecture is that all existing 
applications can gain from the architectural advancement 
represented by the R4000, without requiring re- compilation to 
re-order the software. 

In order to support dual instruction issue, the internal 
pipeline of the R4000 operates at twice the external clock 
frequency. Instruction execution stages such as cache ac- 
cesses are pipelined (thus the chip itself is super-pipelined) 
to eliminate bottlenecks associated with long-latency func- 
tional units. Other stages, such as the ALU stage, completely 
process one operation per pipeline clock cycle, allowing the 
results of one operation to be immediately used by the 


_ instruction which follows, with no pipeline interlocks. 


High clock frequency results from careful construction of 


' the various resources of the processor: pipelining cache 


accesses, shortening register access times, implementing 
virtually indexed primary caches, and allowing the latency of 
functional units to span multiple pipeline stages. 

After extensive simulation of many methods of exploiting 
instruction level parallelism, superpipelining was chosen be- 
cause it improves integer performance commensurate with 
floating-point performance. Thus, the R4000 provides perfor- 
mance benefits both to technical computing applications, and 
also to a wide variety of commercial applications as well. In 


General Purpose Registers 
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today’s sehaclegi. super- eas STRIAGT results in less a panisle! 
logic, faster cycle times, quicker design cycles, and lower 
cost. The pipeline of the IDT79R4000 is illustrated in Figure 1. 


THE R4000 PIPELINE 
~The R4000 processor has an eight-stage execution pipe- 
line. That is, each instruction takes eight Pclock (Pipeline 


‘clocks, at twice the frequency of the input clock) cycles to 


execute, but a new instruction is started on each Pclock cycle. 
Another way of viewing the process is that at any point in time, 
eight separate instructions are being executed at once. Figure 
1 shows the R4000 pipeline in both views: a horizontal slice 
shows the execution process of individual instructions, and a 
vertical slice shows the processing of eight instructions at 


once. 


Each box shown i inthe diagram corresponds to a part of the 
execution process. 

Figure 2 illustrates the activities occuring within each 
pipestage as a function of the instruction type. First, in the IF 
stage, an instruction address is selected by the program 
counter logic and the first half of the both the instruction cache 
fetch (IC1) and the instruction virtual to physical address 
translation (ITLB1) is performed. The instruction address 
translation is done through a two entry subset of the main or 
joint translation lookaside buffer (JTLB) called the ITLB. 
In the IS stage, the second half of both the instruction cache 
fetch (IC2) and instruction translation (ITLB2) is done. 

During the RF stage, three activities occur in parallel. The 
instruction decoder (IDEC) decodes the instruction and 
checks for interlock conditions. Meanwhile, the instruction tag 
check (ITC) is performed between the instruction cache tag 
and the page frame number (PFN) from the ITLB’s translation. 
In parallel with both of the above, the operands are fetched 
from the register file (RF). 

In the EX stage, if the instruction is a register-to-register 
operation, the arithmetic or logical operation is performed 
(ALU). If the instruction is a load/store, a data virtual address 
is calculated (DVA). If the instruction is a branch, a virtual 
branch target address is calculated (IVA). 

For load/stores, the DF stage is used to do the first half of 
both the data cache fetch (DC1) and the data virtual to 
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Figure 3. CPU Registers 
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physical address translation (JTLB1). Similarly, the DS stage 
does the second half of both the data fetch (DC2) and the data 
translation (JTLB2) as well as the load align or store align 
(LSA), as appropriate. If the instruction is a branch, the JTLB 
is used during DF and DS to translate the branch address and 
refill the ITLB if necessary. 

The TC stage is used to perform the tag check for load/ 
stores. During the WB stage the instruction result is written to 
the register file. 

Smooth pipeline flow is interrupted when cache accesses 
miss, data dependencies are detected, or when exceptions 
occur. Interruptions that are handled by hardware, such as 
cache misses, are referred to as interlocks, while those that 
are handled using software are exceptions. Collectively, the 
cases of all interlock and exception conditions are referred to 
as faults. 

Interlocks come in two varieties. Those interlocks which are 
resolved by simply stopping the pipeline are referred to as 
Stalls, while those which require part of the pipeline to ad- 
vance while holding up another part are s/ips. 

At each cycle, exception and interlock conditions are 
checked for all active instructions. The conditions can be 
referred back to particular instructions, as each exception or 
interlock condition corresponds to a particular pipeline stage. 

When an exception condition occurs, the relevant instruc- 
tion and all that follow it in the pipeline are cancelled. Accord- 
ingly, any stall conditions and any later exception conditions 
that are referenced to the same instruction are inhibited; there 


is no value in servicing stalls for a cancelled instruction. Anew 
instruction stream is begun, starting execution at a predefined 
exception vector. System control coprocessor registers are 
loaded with information that will identify the type of exception 
and any necessary auxiliary information, such as the virtual 
address at which translation exceptions occur. 

When a Stall condition is detected, all eight instructions, 
each in a different stage of the pipeline, are frozen at once. 
Often, the stall condition is only detected after parts of the 
pipeline have advanced using incorrect data; this occurrence 
is referred to as pipeline overrun. When in the stalled state, 
parts of the pipeline that are immune to overrun are frozen and 
the remainder is permitted to continue clocking. Just before 
resuming execution, the pipeline overrun is reversed by 
inserting corrected information into the pipeline. 

When aslip condition is detected, the pipeline stages which 
must advance in order to resolve the dependency continue to 
be retired while the dependent stages are held until the 
necessary data is available. 

Another class of interlocks exists which, since they origi- 
nate external to the processor, are not referenced to a particu- 
lar pipeline stage. These interlocks are referred to as external 
stalls and are unaffected by the occurrence of exceptions. 


integer Execution Engine 
The R4000 implements the extended MIPS Instruction Set 
architecture, and thus is fully upwards compatible with appli- 
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Figure 4. The R4000 CPO Registers 
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cations running on the earlier generation parts. The R4000 
includes additions to the instruction set, targeted at improving 
performance and capability while maintaining binary compat- 
ibility with earlier processors. The extensions result in better 
code density, greater multi-processing support, improved 
performance for commonly. used code sequences in operat- 
ing system kernels, and faster execution of. floating-point 
intensive applications. All resource dependencies are made 
transparent to the programmer, insuring transportability 
amongst implementations of the MIES instruction set archi- 
tecture. 

In addition to the instruction extensions detailed above, 
new instructions have been defined to take advantage of the 
64-bit architecture of the processor. When operating as a 32- 
bit processor, the R4000 will take an exception on these new 
instructions. . 

The MIPS integer unit implements a load/store architecture 
with single cycle ALU operations (logical, shift, add, sub) and 
autonomous multiply/divide unit. The programmer model for 
the R4000 includes the register set illustrated in Figure 3. The 
register resources include: 32 general purpose orthogonal 
integer registers, the HI/LO result registers for the intger 
multiply/divide unit, and the program counter. In addition, the 
on-chip floating-point co-processor adds 32 floating-point 
registers, and a floating-point control/status register. 


System Control Co-processor (CPO) 

The system control co-processor in the MIPS architecture 
is responsible for the virtual memory sub-system, the excep- 
' tion control system, and the diagnostics capability of the 
processor. In the MIPS architecture, the system control co- 


processor (and thus the kernel software) is implementation | 
dependent. The R4000 CPO is a superset extension of the. 


MMU found in the R38000A. 


The Memory management unit controls the virtual memory 


system page mapping. It consists of an instruction translation 
buffer (the ITLB), a Joint TLB (the JTLB), and co-processor 
registers used for the virtual memory mapping sub-system. 


System Control Co-Processor Registers | 
The R4000 incorporates all system control co-processor 
(CPO) registers on-chip. These registers provide the path 


through which the virtual memory system's page mapping is. 


examined and changed, the operating modes (kernel vs. user 


mode, interrupts enabled or disabled, cache features) con- __ 


trolled, and these registers contro! exception handling. In 
addition, the R4000 includes registers to implement a real- 
time cycle counting facility, to address reference traps for 
debugging, to aid in cache diagnostic testing, and to assist in 
data error detection and correction. 

Figure 4 illustrates the a Control Co- Processor. 


Virtual to Physical Address Mapping 
The R4000 provides three modes of virtual addressing: 
* user mode | 
¢ kernel mode 
* supervisor mode 


- This mechanism is available to aevcieii software to earvide 
a secure environment for user processes. Bits in a status 
register determine which virtual addressing mode is used. In 
the user mode, the R4000 proves a single, UAE virtual 
address space of 2GB. 

When operating in the kernel mode, four distinct virtual 
address. spaces, totalling 4GB, are simultaneously available 
and are differentiated by the high-order bits os the virtual 
address. 

The R4000 processors also support a supervisor rhode i in 
which the virtual address space is 2.5GB, divided into two 
regions based on the high-order bits of the virtual address. 
The three different modes of virtual addressing are shown in 
Figure 5. When the R4000 is configured asa 64- bit micropro- 
cessor, the virtual address space layout is a compatible 
extension of the 32-bit virtual address space layout. 


Joint TLB 

For fast virtual-to- -physical address decoding, the R4000 
uses a large, fully associative TLB which maps 96 Virtual 
pages to their corresponding physical addresses. The TLB is 
organized as 48 pairs of even-odd entries, and maps a virtual 
address and address space identifier into the large, 649B 
physical address space. 

Two mechanisms are provided to assist in controlling the 
amount.of mapped space, and the replacement characteris- 
tics of various memory regions. First, the page size can be 
configured, on a per-entry basis, to map a page size of 4KB to 
16MB (in multiples of 4). A CPO register is loaded with the page 
size of amapping, and that size is entered into the TLB when 


OxFFFFFFFF Kernel virtual address 
space (kseg3) 
0xE0000000 Mapped, 0.59B 
OxDFFFFFFF 
- Supervisor Virtual 
address space (kseg) 
0xC0000000 Mapped, 0.598 
OxBFFFFFFF 
Uncached kernel physical 
7 address space (kseg1) 
0xA0000000 Unmapped, 0.59B 
Ox9FFFFFFF | : 
Cached kernel physical 
address space (kseg0) 
ox80000000 | Unmapped, 0.59B 
Ox7FFFFFFF : 
space (kseg) 
Mapped, 29B 
0x00000000 
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Figure 5. Kernel Mode Virtual Addressing (32-bit mode) 
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a new entry is written. Thus, operating systems can treat 
various regions of memory distinctly from applications pro- 
grams and data files: for example, a typical frame buffer can 
be memory mapped using only one TLB entry. 

The second mechanism controls the replacement algo- 
rithm when a TLB miss occurs. The R4000 uses a Random 
Replacement algorithm to select a TLB entry to be written with 
a new mapping; however, the processor provides a mecha- 
nism whereby a system specific number of mappings can be 
locked into the TLB, and thus avoid being randomly replaced. 
This facilitates the design of real-time systems, by allowing 
deterministic access to critical software. 

The joint TLB also contains information to control the cache 
coherency protocol for each page. Specifically, each page has 
attribute bits to determine whether the coherency algorithm is: 


uncached, noncoherent, sharable, exclusive, or update. The 
use of these attributes, coupled with state information in the 
processor caches, enables a wide variety of multi-processing 
strategies to be easily implemented. 

Figure 6 shows the format of the TLB entry and registers 
used to control the TLB. | 


Instruction TLB 

The R4000 also incorporates a 2-entry instruction TLB. 
Each entry maps a 4KB page. The instruction TLB improves 
performance by allowing instruction address translation to 
occur in parallel with data address translation. When a miss 
occurs on an instruction address translation, the ITLB is filled 
from the JTLB. The operation of the ITLB is invisible to the 
user. 


PageMask Register 
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12. 
MASK Page comparison mask 
0 Reserved. Must be written as zero; returns Zero when read. 
EntryHi Register 
31 1312 87 0 
VPN2 he | ASID 
19 5 8 

VPN2 Virtual Page Number divided by two (maps to two pages) 

ASID Address Space ID field. An 8-bit field which lets multiple processes share the TLB 
while each process has a distinct mapping of otherwise identical virtual page 
numbers. This is the same ASID described at the beginning of this chapter. 

O Reserved. Must be written as zero; returns zero when read. 
EntryLoO & EntryLo1 
63 62 61 3837 35 3433 32 


PFN 
24 


EScUe 
1 1 


3 


PFN Page Frame Number. Upper bits of the physical address. 

C Specifies the cache algorithm to be used. 

D Dirty. If this bit is set, the page is marked as dirty and, therefore, writable. This bit is 
actually a write-protect bit that software can use to prevent alteration of data. 

V Valid. If this bit is set, it indicates that the TLB entry is valid; otherwise, a TLBL or 
TLBS Miss occurs. | 

G Global. If this bit is set in both LoO and Lo1, then ignore the ASID. 

0 Reserved. Must be written as zero; returns zero when read. 
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Figure 6. Fields of an R4000 TLB Entry 
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Register File : 

The R4000 has thirty-two rn purpose regstele These 
registers are used for scalar integer operations and address 
calculation. The register file consists of two read ports and one 
write port, and uses bypassing to enable the reading and 
writing of the same registeer twice per cycle as well as to 
minimize the operation latency in the pipeline. 


ALU 

The R4000 ALU consists of the integer adder and logic unit. 
The adder performs address calculations in addition to arith- 
metic operations, and the logic unit performs all shift opera- 
tions. Each of these units is highly optimized and can perform 
an operation in a single superpipeline cycle. 


Integer Multiplier/Divider 

The R4000 integer multplier and divider units perform 
signed and unsigned multiply and divide operations and 
execute instructions in parallel with the ALU. The results of the 
operation are placed in the MDH/ and MDLO registers. The 
values can then be transferred to the general purpose register 
file using the MFHI/MFLO instructions. The following table 
shows the number of processor internal cycles required 
between a 32-bit integer multiply or divide and a subsequent 
MFHI or MFLO operation, in order that no interlock or stall 
occurs. 


ee Word Double Word 
ee 
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FLOATING-POINT UNIT 


The R4000 incorporates an entire floating-point unit on 
chip, including a floating-point register file and execution unit. 
The floating-point unit forms a “seamless” interface with the 
integer unit, decoding and executing instructions in parallel 
with the integer unit. 






Floating-point Co-Processor 
The R4000 floating-point execution unit supports single 
and double precision arithmetic, as specified in the IEEE 


Standard 754. The execution unit is broken into separate. 


multiply, divide, and add/convert/square root units, which 
allow for overlapped operations. The multiplier is pipelined, 
allowing a new multiply to begin every 4 cycles. 

As in the 1IDT79R2010 and IDT79R3010, the R4000 main- 
tains fully precise floating-point exceptions while allowing 
both overlapped and pipelined operations. Precise excep- 
tions are extremely important in mission-critical environments, 
such as ADA, and highly desirable for debugging in any 
environment. 

The floating-point unit's operation set includes floating- 
point add, subtract, multiply, divide, square root, conversion 
between fixed-point and floating-point format, conversion 
among floating-point formats, and floating-point compare. 
Thes operations comply with the IEEE Standard 754. 


Floating-Point General Register File 


The following table gives the latencies of some of the 
et instructions in internal PiOc es =e! cycles. 


Single Double 
—— Precision | Precision | 
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ROUND 
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FLOAT 
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SWC1,SDC1 









2884 thi 02 


The floating-point register file is made up of sixteen 64-bit 
registers which can also be viewed as thirty-two 32-bit float- 
ing-point registers. The MIPS architecture supports a 
coprocessor load and store double so, when configured as 64- 
bit registers, the floating-point unit can take advantage of the 
64-bit wide data cache and issue a co-processor load or store 
a doubleword instruction in every cycle. 


_ Floating-Point Control Register File 


_ The floating-point control registers contain a register for 
determining configuration and revision information for the 
coprocessor and control and status information. These are 
primarily involved with diagnostic software, exception han- 
dling, state saving and restoring, and control of rounding 
modes. 


CACHE MEMORY 


Inorder to keep the R4000’s high-performance superpipeline 
full and operating efficiently, the R4000 incorporates on-chip 
instruction and data caches. Each cache has its own 64-bit 
data path that can be accessed twice a cycle, so the instruc- 
tion and data caches can be accessed in parallel with full 
pipelining. Combining this feature with a pipelined, single 
master clock cycle access of each cache, the cache sub- 
system provides the integer and floating-point units with an 
aggregate bandwidth of 2GB per second at a system clock 
frequency of 75MHz. 


Instruction Cache 
The IDT79R4000 incorporates a direct-mapped on-chip 
instruction cache. This virtually indexed, physically tagged 
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cache is 8KB in size and is protected with byte parity. The 
R4400 doubles the on-chip instruction cache to 16KB. 
Because the cache is virtually indexed, the virtual-to- 
physical address translation occurs in parallel with the cache 
access, thus further increasing performance by allowing these 
two operations to occur simultaneously. The tag holds a 24- 
bit physical address and valid bit, and is parity protected. 
The instruction cache is 64-bits wide, and can be refilled or 
accessed twice per master clock cycle, although the current 
IDT79R4000 CPU fetches on 32-bit unit/master cycle for a 
peak instruction bandwidth of 400MB/sec. The line size can 
be configured as four or eight words to allow different applica- 
tions to have a line size that delivers optimum performance. 


Data Cache 

For fast, single cycle data access, the IDT79R4000 includes 
an 8KB on-chip data cache. The R4400 doubles the on-chip 
instruction cache to 16KB. 

The data cache is protected with byte parity and its tag is 
protected with a single parity bit. It is virtually indexed and 
physically tagged to allow simultaneous address translation 
and data cache access. 

The Data Cache is direct mapped, and its line size can be 
configured as four or eight words. The write policy is writeback, 
which means that a Store to a cache line does not immediately 
cause memory to be updated. This increases system perfor- 
mance by reducing bus traffic and eliminating the bottleneck 
of waiting for each Store operation to finish before issuing a 
subsequent memory operation. 

Associated with the Data Cache is the store buffer. When 
the R4000 executes a Store instruction, this 2-entry buffer 
gets written with the store data while the tag comparison is 
performed. If the tag matches, then the data gets written into 
the Data Cache in the next cycle that the Data Cache is not 
accessed. The Store buffer allows the R4000 to execute two 
stores per master cycle and to perform back-to-back stores 
without penalty. Likewise, the R4000 can perform two loads or 
a load and store per master cycle without penalty, yielding 
1.2GB/sec bandwidth without restrictions on instruction com- 
binations. 








Address Data ¥ 






R4000PC 
R4400PC 


When the Data Cache line does need to be written back to 
slower memory (either secondary cache ormain memory), the 
processor writes the data to an internal write buffer which can 
hold a line (4 or 8 words) of data. By writing the data to the fast 
write buffer, the processor can continue executing instruc- 
tions without having to wait until the write completes to the 
slower memory. 

The IDT79R4000 caches are designed for easy and flexibie 
integration in many types of multiprocessor systems. The 
Data Cache contains all the necessary state bits to allow the 
R4000 to maintain cache coherency across all R4000 proces- 
sors in a system. 


SECONDARY CACHE INTERFACE 

The R4000/R4400SC and R4400MC support a secondary 
cache that can range in size from 128KBs to 4MBs. The cache 
can be configured as a unified cache or split into an instruction 
cache and a data cache, and it can be designed using industry 
standard SRAMs. The IDT R4000 provides all of the second- 
ary cache control circuitry on chip, including ECC. 

The secondary cache interface consists of a 128-bit data 
bus, a 25-bit tag bus, and 18-bit address bus, and SRAM 
control signals. The wide data bus improves performance by 
providing a high bandwidth data path to fill the primary 
caches. ECC check bits are added to both the data and tag 
buses to improve data integrity. All double-bit errors can be 
detected and all single bit-errors can be detected and all single 
bit-errors can be corrected on both buses. 

The secondary cache access time is configu rable, srouid: 
ing system designers with the flexibility to tailor the cache 
design to specific applications. The line size of the secondary 
cache is also configurable and can be 4, 8, 16, or 32 words. 
The tine size of the primary cache must always be less than or 
equal to the line size of the secondary cache. 

The secondary cache is physically tagged and physically 
indexed. The physical cache prevents problems that could 
arise due to virtual address aliasing. Also, a physical cache 
makes multiprocessing cache coherency protocols easier to 
implement. The R4400MC provides a set of cache states and 
a mechanism for manipulating the contents and state of the 
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Figure 7. Typical Desktop System Block Diagram 
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cache, which are sufficient to implement:a variety of cache 
coherency protocols, using either bus snooping or directory 
based schemes. _. : ~ 


SYSTEM INTERFACE | | 7 

The R4000 supports a 64-bit system interface that can be 
used to construct systems as simple as a uniprocessor with a 
direct DRAM interface and no secondary cache or as 
sopisticated as a fully cache coherent multiprocessor. The 
interface consists of a 64-bit Address/Data bus with 8 check 
bits and a 9-bit command bus protected with parity. In addi- 
tion, there are 8 handshake signals. The interface has a 
simple timing specification and is capable of transferring data 
between the processor and memory at a peak rate of 6(00MB/ 
sec at 75MHz. ar 

Figure 7 shows a typical desktop system using the R4000PC. 
Similarly, a high-performance desktop workstation/server 
system can be built using the IDT79R4000SC and adding a 
secondary cache. | 

The system interface allows the processor to access exter- 
nal resources in order to satisfy cache misses and uncached 
operations. The IDT79R4000MC, in addition to handling simple 
memory and |/O transactions, supports a number of cache 
coherency transactions of sufficient generality to support a 
variety of cache coherent multiprocessing models. In particu- 
lar, the interface is designed to support both bus snooping and 
' directory based multiprocessor models and supports both 
write-update and write-invalidate coherency protocols. 

Figure 8 shows a typical multiprocessor system using the 
IDT79R4000MC, an interface agent, and a secondary cache. 


System Address/Data Bus 

The 64-bit System Address Data (SysAD) bus is used to 
transfer addresses and data between the R4000 and the rest 
of the system. It is protected with an 8-bit check bus, SysADC. 
The check bits can be configured as either parity or ECC, for 
flexibility in interfacing to either parity or ECC memory sys- 
tems. oT 
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_ The system interface is configurable to allow easier inter- 
facing to memory and I/O systems of varying frequencies. The 
data rate and the bus frequency at which the R4000 transmits 
data to the system interface are programmable via boot time 
mode control bits. Also, the rate at which the processor 
receives data is fully controlled by the external device. There- 
fore, either a low cost interface requiring no write buffering or 
a fast, high performance interface can be designed to commu- 
nicate with the R4000. Again, the system designer has the 
flexibility to make these price/perfomance tradeoffs. 


System Command Bus 

The R4000 interface has a 9-bit System Command 
(SysCmd) bus. The command bus indicates whether the 
SysAD bus carries an address or data. Ifthe SysAD carries an 
address, then the SysCmd bus also indicates what type of 
transaction is to take place (for example, a read or write). If the 
SysAD carries data, then the SysCmd bus also gives informa- 
tion about the data (for example, this is the last data word 
transmitted, or the cache state of this line of data is clean 
exclusive). The SysCmd bus is bidirectional to support both 
processor requests and external requests to the R4000. 
Processor requests are initiated by the R4000 and responded 
to by an external device. External requests are issued by an 
external device and require the R4000 to respond. | 

The R4000 supports. byte, halfword, tribyte, word, 
doubleword, and block transfers on the SysAD bus. In the 
case of a sub-doubleword transfer, the low-order 3 address 
bits gives the byte address of the transfer, and the SysCmd 
bus indicates the number of bytes being transferred. 


Handshake Signals 7 

There are eight handshake signals on the system interface. 
Two of these, RdRdy and WrRdy are used by an external 
device to indicate to the IDT79R4000 whether it can accept a 
new read or write transaction. The IDT79R4000 samples 
these signals before deasserting the address on read and 
write requests. : 
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Figure 8. Multiprocessor System Using the R4400 MC 
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ExtRast and Release are used to transfer control of the 
SysAD and SysCmd buses between the processor and an 
external device. When an external device needs to control the 
interface, it asserts ExtRast. The IDT79R4000 responds by 
asserting Release to release the system interface to slave 
state. 

ValidOut and Validin are used by the IDT79R4000 and the 
external device respectively to indicate that there is a valid 
command or data on the SysAD and SysCmd buses. The 
R4000 asserts ValidOut when it is driving these buses with a 
valid command or data, and the external device drives ValidIn 
when it has control of the buses and is driving avalid command 
or data. 

Finally, there are two signals that are available on the MC 
version only and are used in multiprocessing systems. They 
are IlvdAck and lvdErr, and they are driven by an external 
device to indicate the completion status of the current proces- 
sor invalidate or update request. 





R4000 Requests 

The R4000 is capable of issuing requests to a memory and 
/O subsystem. The system interface supports two modes of 
operation: , 
* Secondary Cache mode 
* No Secondary Cache mode 


No Secondary Cache Mode : 
The R4000 without a secondary cache requires a non- 
overlapping system interface. This means that only one pro- 


TClock 
RClock 
SysAD 


SysADC 


cessor request may be outstanding at a time and that the 
request must be serviced by an external device before the 
R4000 issues another request. The R4000PC can issue read 
and write requests to an external device, and an external 
device can issue read and write requests to the R4000. 

Figure 9 shows a processor read request. The R4000 
asserts ValidOut and simultaneously drives the address and 
read commandonthSysAD and SysCmd buses. If the system 
interface has RdRdy asserted, then the processor tristates 
its drivers and releases the system interface to slave state by 
asserting Release. The external device can then begin send- 
ing the data to the IDT79R4000. 





Secondary Cache Mode 

The R4000 with a secondary cache operates in an overlap- 
ping bus transfer mode in which multiple system interface 
transactions may be issued in parallel. The processor may 
issue a combination of read request, an update or invalidate 
request, and awrite request. For instance, when a dirty cache 
line needs to be replaced, the processor issues a read request 
immediately followed by a write request, without waiting for 
the read data to return. This has the advantage of “hiding” the 
write transaction between the read request and read re- 
sponse, thus increasing overall system performance. This 
mode of operation is not necessary or useful in R4000 
systems withoutsecondary cache since the processor contains 
awrite buffer capable of accepting an entire primary cache line 
of data. Overlapping is a superset of non-overlapping 
systemoperation. 


ValidOut \ / 


Validin 


RdRdy 


WrRady 


Release 
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Figure 9. Processor Read Request 
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Figure 10 illustrates a processor request in overlap mode. 
This request is made up of a read, invalidate, and write 
request. Note that the protocol for the read, the invalidate, and 
the write are all similar to each other, with the exception that 
the processor also sends out valid data during the write 
request. In Figure 10 the processor write transaction not only 
occurs before the read response from the external device, but 
it also illustrates how an external device can hold off a write 
request through the deassertion of ey 





External Requests: 

The R4000 responds to requests issued iy an terial 
device. The requests can take several forms. An external 
device may need to supply data in response to an R4000 read 
request or it may need to'gain control over the system 
interface bus to access other resources which may be on that 
bus. It also may issue cache coherency requests to the 
processor, such as a request for the R4000 to update, invali- 
date, or snoop upon its caches, or to supply a cache line of 
data. Additionally, an external device may need to write to the 
R4000 interrupt register. 

The following isa list of the Suara avicnll requests: 
¢ Read 
¢ Write 
* Invalidate 
« Update 
¢ Snoop 
¢ Intervention. 


| Null 


TClock 
RClock 
SysAD Bus 
SysCmd Bus 


ValidOut 


Figure 11 shows an example of an external snoop request. 
The process by which the external device issues the request 
is very similar to the way the R4000 issues a request. The 
external device first gains ownership of the system interface 
by asserting. ExtRqst and waiting for the R4000 to assert 





Release. The external device then sends in a valid command 





by asserting ValidIn and driving the SysCmd and SysAD 
buses with the snoop command and address: The R4000 
responds to the request by asserting ValidOut and driving the 
sysCmd bus with the cache state of the snooped ones Mae: 


CACHE COHERENCY CAPABILITY | 

With the IDT79R4400MC, cache coherence is maintained 
in hardware. The system control coprocessor permits the 
specification of different caching protocols on a per- page 
basis. A page may be: 
uncached 
cached but non- -coherent | 
cached and coherent exclusive (only one processor 
cache contains the data on loads and stores). | 
cached and coherent exclusive on writes (write invalidate 
scheme-only one processor cache contains the data 
when that datum is written to). 
cached and coherent with updates on writes (write- 
update scheme). 
Depending upon the amount and type of data sharing i inan 
application, the operating system can choose the most appro- 
pale owe strategy. 


e ® 


e 


C2 Cod Cz) GLAD CD CED CLD Cz 
Ga) CLD CD GALLEN CLD GID GID Gal 
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RdRdy 
WrRdy 


Release 
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Figure 10.. Processor Read, invalidate, Write Request 
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Support for processor synchronization is provided by the 
Load Linked and Store Conditional instructions. The Load 
Linked and Store Conditional instructions: 

1. Provide a simple mechanism for generating all of the 
common synchronization primitives including test-and- 
set, bit-level locks, semaphores, counters, sequencers, 
etc. with no additional hardware overhead. 

. Operate in such a fashion that bus traffic is only gener- 
ated when the state of the cache line changes. 

. Need not lock a system bus—a very important feature 
for larger systems. 


ADVANCED FEATURES 


The R4000 supports a number of other capabilities in 
addition to the standard processor model described above. 
Many of these capabilities are selected by the system de- 
signer during the processor reset Sequence, via the boot time 
mode control interface. Features are included to support fault 
tolerance, system test, or other system environments. 


Boot Time Options 

Fundamental operational modes for the processor are 
initialized by the boot-time mode control interface. The boot- 
time mode control interface is a serial interface operating ata 
very low frequency (Master clock divided by 256). The low 
frequency operation allows the initialization information to be 
kept in a low cost EPROM. 

Immediately after the VCCOk Signal is asserted, the pro- 
cessor reads a serial bit stream of 256 bits to initialize all 
fundamental operational modes. After initialization is com- 


TClock 


RClock 


SysAD Bus 


SysCmd Bus 


plete, the processor continues to drive the serial clock output, 
but no further initialization bits are read. 


JTAG INTERFACE 

The JTAG boundary scan mechanism provides a capa- 
bility for testing the interconnect between the IDT79R4000 
processor, the printed circuit board to which itis attached, and 
the other components on the board. In addition the JTAG 
boundary scan mechanism provides a rudimentary capability 
for low-speed logical testing of the secondary cache RAMs. 
The JTAG boundary scan mechanism does not provide any 
capability for testing the R4000 processor itself. 

In accordance with the JTAG specification the R4000 
processor contains a TAP controller, JTAG instruction regis- 
ter, JTAG boundary scan register, JTAG identification regis- 
ter, and JTAG bypass register. However, the R4000 JTAG 
implementation provides only the external test functionality of 
the boundary scan register. 


FAULT TOLERANT SUPPORT 


The R4000 has been designed to support varying models 
of fault tolerance. These modes include: master/checker 
operation and triple-modular redundancy. !n addition to ex- 
plicit fault-tolerant modes of operation, the design of internal 
processor operation is such to support processor synchroni- 
zation; for example, both the TLB random replacement algo- 
rithm, and the on-chip timer, can be forced to known states via 
software. Thus, the IDT R4000 family can be used to build 
“non-stop” machines across a number of different system 
models. a 


ValidOut \ VA 


Validin 


ExtRast 


Release 
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Figure 11. External Snoop Request 
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BOOT-TIME MODES 


Mode Setting es Bs -_ 





BikOrder: Block read response ercenng: 
Sequential ordering. 
Sub-block ordering. 


. EIBParMode: System interface check bus checking. 


SECDED error checking and correcting mode. 


EndBit: Byte ordering. 
Little Endian. 
Big Endian. 


DShMdDis: Dirty shared mode, enables transition to dirty shared state on pioees sor: update | 
successful. 
Dirty Shared Enabled. 
_ Dirty Shared Disabled. 


NoSCMode: Specifies presence of secondary cache. 
Present. 
Not Present. 


SysPort: System Interface port width (Bit 6 Most Significant). 
64 bits. 
Reserved") 


SC64BitMd: Secondary cache interface port width. 
128 bits. 
~ Reserved") 


ElSpitMd: Secondary cache organization 
Unified 
Reserved") 
SCBIkSz: Secondary cache line size (Bit 10 Most Significant). 
4 words. 
8 words. 
16 words. 
32 words. 


XmitDatPat: System Interface Data Rate (Bit 14 Most Significant). 
D . "4 
DDx 
DDxx 

~ DxDx 
DDxxx 
DDxxxx 
DxxDxx 
DDxxxxxx 
DxxxDxxx 
Reserved") 


SysCkRatio: PClock to SClock divisor: frequency relationship between SClock, RClock, and 
TClock and PClock (Bit 17 MostSignificant). 

Divide by 2 

Divide by 3 © 

Divide by 4 

Reserved") 


TimIntDis: Timer Interrupt enable allows timer interrupts, otherwise the interrupt used by the 
timer becomes a general-purpose interrupt. 

Enabled 

Disabled 


PotUpdDis: Potential invalidate enable (allows potential invalidates to be issued. Otherwise 
only normal invalidates are issued). a ee 

Enabled 

Disabled 
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Value Mode Setting 


TWrSUp: Secondary cache write deassertion delay, TWrSup in PCycles (Bit 24 Most Signifi 
cant). e 
Undefined 

Mumber of PCLK cycles (Min 3; Max 15) 


TWr2Dly: Secondary cache write assertion delay 2, TWrDly in PCycles (Bit 26 Most Signifi 
cant). 

Undefined 

Number of PCLK cycles (Min 1; Max 3) 


TWr1Dly: Secondary cache write assertion delay 1, TWrDly in PCycles (Bit 28 Most Signifi 
cant). 

Undefined . 
Number of PCLK cycles (Min 1; Max 3) 


TWrRc: Secondary cache write recovery time, TwrRc in PCycles either 0 or 1 cycles. 
0 cycle 
1 cycle 


TDis: Secondary cache disable time, TDis in PCycles (Bit 32 Most Significant). 
Undefined 
Number of PCLK cycles (Min 2; Max 7) 


TRd2Cyc: Secondary cache read cycle time 2, TRdCyc2 in PCycles (Bit 36 Most Significant). 
Undefined 
3-15 Number of PCLK cycles (Min 3; Max 15) 


TRd2Cyc: Secondary cache read cycle time 1, TRdCyc1 in PCycles, (Bit 40 Most Significant). 
Undefined 
Number of PCLK cycles (Min 4; Max 15) 


Pkg179: R4000 type. 
Large (447 pin). SC/MC 
Small (179). PC 

CycDivisor: This mode determines the clock divisior for the reduced power mode. When the 

RP bit in the Status Register is set to one, the pipeline clock is divided by one of the following 

values (Bit 49 is Most Significant). 
Divide by 2 
Divide by 4 
Divide by 8 
Divide by 16 

Reserved") 

Drv0_50, DrvO_75, Drv1_00: Drive the outputs in N x MasterClock period (Bit 52 Most 

Significant) . 

Drive at 0.5 x MasterClockperiod. 
Drive at 0.75 x MasterClock period. 
Drive at 1.0 x MasterClock period. 


InitP: Initial values for the state bits that determine the pull-down di/dt and switching speed 
of the output buffers (Bit 53 Most Significant). 

Fastest pull-down rate. 

Intermediate pull-down rate. 

Slowest pull-down rate. 


InitN: Initial values for the state bits that determine the pull-up di/dt and switching speed 
of the output buffers (Bit 57 Most Significant). 
Slowest pull-up rate. 
Intermediate pull-up rates. 
Fastest pull-up rate. 


EnbIDPLLR: Enables the negative feedback loop that determines the di/dt and switching 
speed of the output buffers only during ColdReset. 

Disable di/dt control mechanism. 
Enable di/dt control mechanism. 
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Serial Bit 


62 | 










Mode Setting 


EnbIDPLL: Enables the negative feedback loop that determines the di/dt and switching 
speed of the output buffers during ColdReset and during normal operation. 

Disable di/dt control mechanism. 
Enable di/dt control mechanism. 


DsbIDPLL: Enables PLLs that match Masterln and produce RClock, TClock, SClock and the 
internal clocks. 

Enable PLLs. 
Disable PLLs. 


SRTristate: Controls when output-only pins are trestated 
Only whe Cold Reset is asserted. 
1 When Reset or Cold Reset are asserted 


65-255 Reserved (must be scanned in as Zeros). 


NOTES: 2884 tbl 05 
1. Selecting a Reserved value results in undefined processor behavior. 
2. O's must be presented for these reserved values. 
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PIN DESCRIPTION 


The following is a list of interface, interrupt, and mainte- 
nance pins available on the different package configurations. 


[Pinname [Type] —SSS™S~S~S erp SCSCSC~“~*~*~*~*~*S 


Secondary cache interface pins available only on the SC and MC configuration: 


SCAddr(17:1) Output Secondary cache address bus 
A 17-bit address bus for the secondary cache. 
SCAddr0(W:Z) Output Secondary cache address Isb 
To minimize loading effect, there are 4 identical copies of this signal. | 
2:0) 


SCAPar(2:0 Output Secondary cache address parity bus 
A 3-bit bus that carries the parity of the SCAddr bus and the cache control lines SCOE, SCWR, 
SCDCS and SCTCS. 


SCData(127:0) Input/Output Secondary cache data bus 
A 128-bit bus used to read or write cache data from/to the secondary cache. 


SCDChk(15:0) Input/Output Secondary cache data ECC bus 
A 16-bit bus that carries two 8-bit ECC fields that covers the 128 bits of the SCData from/to the 
secondary cache. SCDChk(15:8) corresponds to SCData(127:64) and SCDChk(7:0) corre- 
sponds to SCData(63:0). 








2) 
O 
O 
O 
” 













2) 
QO 
O 
m 


Output Secondary cache data chip select 
Chip select enable signal for the secondary cache Ram associated with SCData and SCDChk. 
Output Secondary cache output enable 
Output enable for the secondary cache RAM. | 
(24:0) Input/Output Secondary cache tag bus 
A 25-bit bus used to read or write cache tags from/to the secondary cache 
SCTChk(6:0) Input/Output Secondary cache tag ECC bus 
A 7-bit bus that carries an ECC field covering the SCTag from/to the secondary cache. 


Output Secondary cache tag chip select 
Chip select enable signal for the secondary cache tag RAM associated with SCTag and 
SCTChk. 


ep) 
© 
—- 
ped) 
Ko} 








a 
O 
~ 
9) 
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Output Secondary Cache write enable 
Write enable for the secondary cache RAM 


o) 
e) 
z 
= 
ce 





System interface pins available on all parts: 














(7:0) Input/Output | System address/data check bus 
An 8-bit bus containing check bits for the SysAD bus. 


SysCmd(8:0) Input/Output System command/data identifier bus 
A 9-bit bus for command and data identifier transmission between the processor and an 


external agent. 


Input/Output | System command/data identifier bus parity | 
| Asingle, even-parity bit for the SysCmd bus. 


Input Valid Input . 
Signals that an external agent is now driving a valid address or data on the SysAD bus and a 
valid command or data identifier on the SysCmd bus. 


ExtRast Input External request 
Signals that the system interface needs to submit an external request. 
Release Output Release interface 
| Signals that the processor is releasing the system interface to slave state | 
RdRdy Input Read Ready 
Signals that an external agent can now accept a processor read, invalidate, or update request 
in both secondary cache and no secondary cache mode or can accept a read followed by a 
| write request in secondary cache mode. 
SysAD(63:0) Input/Output System address/data bus 
A 64-bit address and data bus for communication between the processor and an external 
agent. 





s|| & 
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[FinName [Type [SSCS erptoSSCSC—“‘“~*~*Y 


Output Valid output . | 
Signals that the processor is now driving a valid address or data on the SysAD bus and a valid 





Valid Out 


command or data identifier on the SysCmd bus. 


WrRdy Input Write Ready 
Signals that an external agent can now accept a processor write request in both non-overlap 
and overlap mode. | ! 


System interface pins available only on the MC configuration. 















lvdAck Input Invalidate acknowledge : 
Signals successful completion of a processor invalidate or update request. 


IvdErr Input Invalidate error | | 
Signals unsuccessful completion of a processor invalidate or update request. | 


Interrupt pins available only on the PC configuration: . 


Int(5:1) Input Interrupt | 
Five of six general processor interrupts, bit-wise ORed with bits 5:1 of the interrupt register. 


Interrupt pin available on all devices: 


Int(0) Input 











Interrupt 





One of six general processor interrupts, bit-wise ORed with bit 0 of the interrupt register. 





Non-maskable interrupt pin available on all devices: 
NMI Input Non-maskable interrupt 
| Non-maskable interrupt, ORed with bit 6 of the interrupt register. 
Boot-time mode control interface pins available on all devices: . 
Serial boot-mode data clock output at the system clock frequency divided by two hundred and 


ModeClock Output 

; | fifty six. | | 
Modein Input Boot mode data in | - 

| , Serial boot-mode data input. 


JTAG interface pins available on all devices: . | 
JTDI JTAG datain. . | | 
| | JTAG serial data in. 
JTCK Input JTAG clock input | 
| JTAG serial clock input. 
Output JTAG data out | 
| JTAG serial data out. 

JTMS Input JTAG command | | 

JTAG command signal, signals that the incoming serial data is command data. 
Maintenance pins available on alldevices: | | oe 
lOOut Output /O output : 

Output slew rate control feedback loop output. Must be connected to |OIn through a delay loop 

that models the IO path from the R4000 to an external agent. 

Input /O input 
| Output slew rate control feedback loop input (see |OOut). . 
MasterClock Input Master clock : | 
| Master clock input at the processor operating frequency. | 

MasterOut Output Master clock out 

Master clock output aligned with MasterClock. 
| RClock(1:0) Output | Receive clocks . | ; a | 

| Two identical receive clocks at the system interface frequency. 


SyncOut Output Synchronization clock out | 
Synchronization clock output. Must be connected to Syncin through an interconnect that 
models the interconnect between MasterOut, TClock, RClock, and the external agent. 
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[PinName [Type Description 
Syncin Synchronization clock in 
Synchronization clock input. See SyncOut. | 
TClock(1 Output Transmit clocks 
Two identical transmit clocks at the system interface frequency. 
VCCOk Input VCC is OK 

When asserted, this signal indicates to the R4000 that the +5 volt power supply has been 

above 4.75 volts for more than 100 milliseconds and will remain stable. The assertion of 

VCCOk initiates the reading of the boot-time mode control serial stream. 

CoidReset Input 
and RClock begin to cycle and are synchronized with the de-assertion edge of Cold Reset. 
ColdReset must be de-asserted synchronously with MasterOut. 
Input Reset . 

This signal must be asserted for any reset sequence. It may be asserted synchronously or 
asynchronously for a cold reset, or synchronously to initiate a warm reset. Reset must be de- 
asserted synchronously with MasterOut. 

Output Fault 
Mismatch output of boundary comparators. 
Input Quiet VCC for PLL 
Quiet Vcc for the internal phase locked loop. 
Input Quiet VSS for PLL 
Quiet Vss for the internal phase locked loop. 


Maintenance pins available only on the SC and MC configurations: 


Status(7:0) Status Output 
An 8-bit bus that indicates the current operation status of the processor. 
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Cold reset 
This signal must be asserted for a power on reset or a cold reset. The clocks SClock, TClock, 
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ABSOLUTE MAXIMUM RATINGS") - RECOMMENDED OPERATION > a 


Symbol] Rating __—_—_—‘([ Commercial] Unt | TEMPERATURE AND SUPPLY VOLTAGE 
Tc Operating Temperature — | Oto+85 | °C | > | a 3 , 
(Case) oO : 


















2884 tbl 10 
| Tsias | Case Temperature Under Bias | -55 to +125 
Storage Temperature:  . . | -55to +125 | 
DC Output Current | 50m 


NOTES: _ 2884 tbl09 - 

1. Stresses: greater than those listed under ABSOLUTE MAXIMUM 
RATINGS may cause permanent damage to the device. This is a stress 
rating only and functional operation of the device at these or any other 
conditions above those indicated in the operational sections of this 
specification is not implied. Exposure to absolute maximum rating condi- 
tions for extended periods may affect reliability. : — 

2. Vinminimum = -3.0V for pulse width less than 15ns. Vin should not exceed 
Vec +0.5 Volts. 

3. Notmore than one output should be shorted ata time. Duration of the short 
should not exceed 30 seconds. 







DC ELECTRICAL CHARACTERISTICS—COMMERCIAL TEMPERATURE RANGE 
(Vcc = 5.0V + 5%; Tcase = 0°C to +85°C) ae S . | 


Symbol Parameter Conditions 


VOHC Output HIGH Voltage |lOH=—4mA 4.0 
(MasterOut, TClock, 
RClock, SyncOut)(S) 
vie Ree ed 
n 















seal ced 
oro 

, > |! 
oO 






: V 
V 


é 
on 


VIHC Input HIGH Voltage 
(MasterClock, Syncin) 

VILC Input LOW Voltage 
(MasterClock, Syncin 


Input LOW Voltage( 2) 
) 
Input Capacitance Enid 


~0.5() | 0.2Vcec | -0.5( | o8Vcec} -0.5(1) 


Oo 


Output Capacitance =e 


Input/Output Leakage 
Icc Operating Current Vcc = 5.5V, Tc=0°C | | 


NOTES: 2884 tbl 11 
1. Vit (min.) = -3.0V for pulse width less than 15ns. 

2. Except for MasterClock input. 

3. Applies to TClock, RClock, MasterOut, and ModeClock outputs. 
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AC ELECTRICAL CHARACTERISTICS—COMMERCIAL TEMPERATURE RANGE 
(Vcc=5.0V + 5%; Tcase = 0°C to +85°C) MasterClock and Clock Parameters(@) 


| sont | 67M 75Miz 
Symbol | Parameter Conditions 


| Min, 

MasterClock High Ca ie a 
[Twer.ow | MasterClockLow | | 
Ee 

| 20_| 











TMCP MasterClock Period 


Clock Jitter 
(on RClock, TClock, 
MasterOut, SyncOut) 


MasterClock Rise Time 


NOTES: 2884 tbl 12 
1. Operation of the R4000 family is only guaranteed with the phase lock loop enabled. 

2. Capacitive load for all output timings is 50pF. Deration is per CLD specification. 

3. Transition < 5ns for 50, 67MHz; transition < 3.5ns for 75MHz. 













SYSTEM INTERFACE PARAMETERS 








Max Slew Rate 
Modebits[53:56] = 0 
Modebits[57:60] = 15 


Min Slew Rate 
Modebits[53:56] = 15 
Modebits[57:60] = 0 


a 


NOTES: 2884 tbl 13a 

1. When the dynamic output slew rate control Modebits [61] or [62] are enabled, the initial values for the pull-up and pull-down rates should be set to 
the slowest value, Modebits [53:56]=15, Modebits[57 :60]=0. 

2. Timings are measured from 1.5V of the clock to 1.5V of signal. 

3. Capacitive load for all output timings is 50pF. Deration is per CLD specification. 

4. Data Output, Data Setup and Data Hold apply to all logic signals driven out of or driven into the R4000 on the system interface. Secondary cache 
signals are specified separately. 


Tpo!2:3 Data Output 




















BOOT MODE INTERFACE PARAMETERS 
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|McLk cycles | 
| MCLK cycles 


2884 tbl 13b 












5.7 21 







IDT79R4000 Family = -©- - COMMERCIAL TEMPERATURE. RANGE 








SECONDARY CACHE INTERFACE PARAMETERS — 


Symbol 


PClock to. Output Max Slew Rate 
| Modebits[53:56] = 0 
Modebits[57:60] = 15 


Min Slew Rate 
_| Modebits[53:56] = 15 
“Modebits[57:60] = 0 


TwrRe4 Cycles bet. deassertion of : | 

. SCWr to start of next cycle Pcycles © 

Twrsup* | Cycles from second oe ——— _ 
doubleword to SCWr | | Pcycles 


TRd2Cyc4 | Cycle length of 8 word Rd 
Twr2Dly* | Cycles betweenist & 2nd 


word in 8-word write 


NOTES: ne ‘a . . * 2884 tbl 14 

1. When the dynamic output slew rate control Mode bits [61] or [62] are enabled, the initial values for the pull-up and pull-down rates should be set to 
the slowest value, Modebits [53:56]=15, Modebits[57:60]=0. 

2. Timings are measured from 1.5V of the Pclock to 1.5V of signal. . 

3. Capacitive load for all output timings is 50pF. Deration is per CLD specification. 

4. Number of cycles is configured through the boot time mode control. 


CAPACITIVE LOAD DERATION 


Parameter 
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IDT79R4000 Family COMMERCIAL TEMPERATURE RANGE 





PHYSICAL SPECIFICATIONS 
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IDT79R4000 Family .—s_—_y. 


R4000 


Function 
Cold Reset 


ExtRqst 
Fault 


Reserved (NC) 


Vcc 
lOln 
lOOut 
into 
Intt 
Int2 
Int3 
Int4 
Int 
JTCK 
JTDI 
JTDO 
JTMS 


MasterClock 


MasterOut 


ModeClock 


Modelin 
NMI 
PLLCapo 
PLLCap1 
RClockO 
RClock1 
RdRdy 
Release 
Reset 
Syncin 
SyncOut 
SysADO 
SysAD1 
SysAD2 
SysAD3 
SysAD4 
SysAD5 
SysAD6 
SysAD7 
SysAD8 
SysAD9 
SysAD10 
SysAD11 
SysAD12 
SysAD13 
SysAD14 
SysAD15 
SysAD16 
SysAD17 
SysAD18 
SysAD19 
SysAD20 
SysAD21 
SysAD22 
SysAD23 
SysAD24 
SysAD25 
SysAD26 
SysAD27 
SysAD28 


PC Pkg 
Pin 


IDT79R4000/4400 PC PACKAGE PINOUT 


R4000 PC Pkg | R4000 PC Pkg 
| Function Pin Function Pin 


SysAD29 
SysAD30 
SysAD31 
SysAD32 


— SysAD33 


SysAD34 


~ SysAD35 


SysAD36 
SysAD37 
SysAD38 
SysAD39 
SysAD40 
SysAD41 
SysAD42 
SysAD43 
SysAD44 


~ SysAD45 


SysAD46 
SysAD47 


. SysAD48 


SysAD49 
SysAD50 
SysAD51 
SysAD52. 
SysAD53 
SysAD54 
SysAD55 
SysAD56 
SysAD57 
SysAD58 
SysAD59 
SysAD60 
SysAD61 
SysAD62 
SysAD63 
SysADCO 
SysADC1 
SysADC2 
SysADC3 
SysADC4 
SysADC5 
SysADC6 
SysADC7 
SysCmdo0 
SysCmd1 
SysCmd2 
SysCmd3 
SysCmd4 
SysCmd5 
SysCmd6 
SysCmd7 
SysCmd8 
SysCmdP 
TClockO 
TClockt 
VCCOk 
Validin 
Valid Out 
WrRdy 
VecP 


5.7 


COMMERCIAL TEMPERATURE RANGE 
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IDT79R4000 Family COMMERCIAL TEMPERATURE RANGE 





PHYSICAL SPECIFICATIONS 
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9 e e e e e e ® e e e e e e e e §8606©68 e e e 


R4000, R4400 MC/SC 
Pinout 
Bottom View 
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IDT79R4000 Family - 


R4000 


Function 


Cold Reset 
ExtRast 
Fault 


lOin 
lOOut 
Into 
lvdAck") 
lvd Errtts 
JTCK 
JTDI 
JTDO 
JTMS 


MasterClock 


MasterOut 
ModeClock 


Reserved (NC) 


SCAPar0 
SCAPar1 
SCAPar2 
SCAdadr1 
SCAddr2 
SCAddr3 
SCAddr4 
SCAddr5 
SCAddré6 
SCAddr7 
SCAdadr8 
SCAddr9 
SCAddr10 
SCAdadr11 
SCAddr12 
SCAddr13 
SCAddr14 
SCAddr15 
SCAddr1i6 
SCAddr17 
SCAddr0W 
SCAddr0X 
SCAddr0Y 
SCAddr0Z 
SCDCS 
SCDChkO 
SCDChk1 
SCDChk2 
SCDChk3 
SCDChk4 
SCDChk5 
SCDChk6 
SCDChk7 
SCDChk8& 


Reserved (NC) 


SC/MC Pkg 
Pin 





IDT79R4000/R4400 MC/SC PACKAGE PINOUT 


R4000 
Function 


~ SCDChk9 
SCDChk10 


SCDChk11 


‘SCDChk12 ~ 
SCDChk13 _ 


SCDChk14 
SCDChk15 
SCData0d 
SCData1 
SCData2 
SCData3 
SCData4 
SCData5 
SCDataé 
SCData7 
SCData8 
SCData9 
SCData10 
SCData11 
SCDatai2 
SCData13 
SCData14 
SCData15 
SCData16 
SCData17 
SCData18 
SCData19 
SCData20. 
SCData21 
SCData22 
SCData23 
SCData24 
SCData25 
SCData26 
SCData27 
SCData28 
SCData29 
SCData30 
SCData31 
SCData32 
SCData33 
SCData34 
SCData35 
SCData36 
SCData37 
SCData38 
SCData39 
SCData40 
SCData41 
SCData42 
SCData43 
SCData44 
SCData45 
SCData46 
SCData47 
SCData48 
SCData49 
SCData50 
SCData51 
SCData52 


5.7 


SC/MC Pkg 
- Pin 


COMMERCIAL TEMPERATURE RANGE 


SCData53 © 
SCData54 
SCData55 
SCData56 
SCData57 
SCData58 
SCData59 
SCData60. 
SCData61 
SCData62 
SCData63 
SCData64 
SCData65 
SCData66 
SCData67 
SCData68 
SCData69 
SCData70 
SCData71 
SCData72 
SCData73 
SCData74 
SCData75 
SCData76 
SCData77_ 
SCData78 
SCData79 
SCData80 
SCData81 
SCData8&2 
SCData8&3 
SCData84 
SCData85 
SCData86 
SCData87 
SCData88 
SCData89 
SCData90 
SCData91 
SCData92 
SCData93 
SCData94 
SCData95 
SCData96 
SCData97 
SCData98 
SCData99 
SCData100 
SCData101 
SCData102 
SCData103 
SCData104 
SCData105 
SCData106 
SCData107 
SCData108 
SCDatai09 
SCData110 
SCData111 
SCDatai112 








~ R4000 SC/MC Pkg | 
Function — Pin 
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IDT79R4000/R4400 MC/SC PACKAGE PINOUT (continued) 


R4000 SC/MC Pkg R4000 R4000 SC/MC Pkg 


Function 


SCData113 
SCData114 
SCData115 
SCData116 
SCData117 
SCData118 
SCData119 
SCData120 
SCData121 
SCData122 
SCData123 
SCData124 
SCData125 
SCData126 
SCData127 


Function 


Status7 
Syncin 
SyncOut 
SysADO 
SysAD1 
SysAD2 
SysAD3 
SysAD4 
SysAD5 
SysAD6 
SysAD7 
SysAD8 
SysAD9 
SysAD10 
SysAD11 
SysAd12 
SysAD13 
SysAD14 
SysAD15 
SysAD16 
SysAD17 
SysAD18 
SysAD19 
SysAD20 
SysAD21 
SysAD22 
SysAD23 
SysAD24 
SysAD25 
SysAD26 
SysAD27 
SysAD28 
SysAD29 
SysAD30 
SysAD31 
SysAD32 
SysAD33 
SysAD34 
SysAD35 
SysAD36 
SysAD37 
SysAD38 
SysAd39 
SysAD40 
SysAD41 
SysAd42 
SysAD43 
SysAd44 
SysAD45 
SysAD46 
SysAD47 
SysAD48 
SysAD49 
SysAD50 
SysAD51 
SysAD52 
SysAD53 
SysAD54 
SysAD55 
SysAD56 


Function 


SysAD57 
SysAD58 
SysAd59 
SysAD60 
SysAD61 
SysAD62 
SysAD63 
SysADCO 
SysADC1 
SysADC2 
SysADC3 
SysADC4 
SysADC5 
SysADC6 
SysADC7 
SysCmd0 
SysCmd1 
SysCmd2 
SysCmd3 
SysCmd4 
SysCmd5 
SysCmd6 
SysCmd7 
SysCmd8 
SysCmdP 
TClockO 
TClock1 
VCCOk 
Validin 
Valid Out 
WrRdy 
VccSense 
VssSense 
VcecP 
VssP 

Vcc 

Vcc 

Vcc 

Voc 

Vcc 

Vcc 

Vcc 

Vec 

Vcc 

Vcc 

Vec 

Vcc 

Vcc 

Vec 

Vcc 

Vcc 

Vec 

Vcc 

Vec 


Pin 
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|DT79R4000/R4400 MC/SC PACKAGE PINOUT (continued) 


~R4000- SC/MC Pkg | -R4000° SC/MC Pkg R4000 SC/MC Pkg 
Function Pin | Function Pin — Function ~ Pin» 


Vec | — en Whee oe , | | Y4. 

Veo | V6 s ee 
Veo ‘. * 5 . AB6 
Veo - | | 

Vec oe : | | 

Vcc | B34. | oo 7 AF2. | 
Veco — ey | | , | 
Veco | 7 : 
Veco | | : : 

Vcc | . : | a | 
Vcc | 
Vcc 

Vcc 

Vcc 

Vcc - 

Vcc 

Vcc 

Vcc 

Vec 

Vcc 

Vcc 

Vec 

Vec 

Vec 

Vcc 

Vec 

Vec 

Vec 

Vcc 








NOTE: ng 2884 tbl 19 


1. Available in IDT79R4400MC only. For IDT79R4000SC and R4400SC, these inputs must be pulled to Vcc. 
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IDT79R4000 Family COMMERCIAL TEMPERATURE RANGE 








ORDERING INFORMATION 
IDT XXXX YY 999 A A 
Device Configuration Speed Package Process/ 
Type Temperature 


Range 


Commercial (0°C to +85°C (Case)) 


PGA 179 for PC 
PGA 447 for SC, MC 


50MHz 
67MHz 
75MHz 


Primary Cache Only 
Secondary Cache 
Cache Coherent 





R4000 8KB Each of I-, D-Cache 
R4400 16KB Each of I-, D-Cache 
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VALID COMBINATIONS 


R4000 PC — 50, 67 
R4000 SC — 50, 67 
R4400 PC — 50, 67, 75 
R4400 SC — 50, 67, 75 
R4400 MC — 50, 67, 75 


@ @ GG) G 
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RISC SUPPORT COMPONENTS 


A RISC microprocessor is an important, but not self- 
sufficient, element of a high-performance general or embed- 
ded computing system. Equally important is the memory 
system (both cache and main memory) and the |/O interface 
to the execution core. 

To simplify the task of building these high-performance 
subsystems, IDT produces a wide variety of Support chips and 
building block devices. These chips range from general pur- 
pose devices such as fast static RAM and high-performance 
logic (used with many processor families), to specialized 
devices used in only certain types of applications (such as the 
IDT LaserFIFO, used in laser printer systems) and devices 
designed to work with only a specific processor family. 

Generic building block devices include SRAMs, with den- 
sities from 16KB to 1MB and access times as low as 7ns, as 
well as high-speed logic devices such as the FCT-T family. 

Devices specifically developed for RISC systems include 
the 3720 Bus Exchanger and 3721 DRAM Controller. These 
components facilitate design of systems based upon the 
R3051/52 controller family. The DRAM and |/O controllers 
have direct bus interface to the 3051/52. 


6.0 


The R3020 Write Buffer enhances the performance of 
R3000 systems by allowing the processor to perform write 
operations at full clock speeds instead of resorting to time- 
consuming CPU stall cycles. The memory can then retire the 
data at a slower rate. The R32xx family of read/write buffers 
includes the memory read capability, enabling the use of 
slower main memory without impacting system performance. 

The R3730 Integrated System Controller is a highly pro- 
grammable controller for high-performance Raster Image 
Systems. It contains DMA, FIFOs and arbitration logic to 
provide bandwidth matching between CPU/memory bus and 
slower standard |/O peripherals. 

By providing these system solutions as building blocks, IDT 
allows its customers the maximum flexibility in achieving their 
price/performance goals while minimizing time-to-market, real 
estate and complexity of the end system. 

This section of the data book contains some selected 
devices which have either been specifically designed for 
particular RISC processors or found to be exceptionally useful 
in these high-performance systems. 
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RISC FLOATING POINT 
ACCELERATOR (FPA) 


Integrated Device Technology, Inc. 


IDT79R3010A 
IDT79R3010AE 





FEATURES: 
* Hardware Support of Single and Double-Precision 
Operations: 
— Floating-Point Add 
— Floating-Point Subtract 
— Floating-Point Multiply 
— Floating-Point Divide 
— Floating-Point Comparisons 
— Floating-Point Conversions 
* Sustained performance: 
— 11 MFLOPS single precision LINPACK 
— 7.3 MFLOPS double precision LINPACK 
¢ 16.7MHz through 40MHz operation 
¢ Direct, high-speed interface with IDT79R3000A and 
IDT79R3001 Processor 
¢ Supports Full Conformance With IEEE 754-1985 Floating- 
PointSpecification 
Full 64-bit operation using sixteen 64-bit data registers 
High-speed CMOS technology 
32-bit status/control register providing access to all IEEE- 
Standard exception handling 


¢ Load/store architecture allows data movement directly 
between FPA and memory or between CPU and FPA 
* Overlapped operation of independent floating point ALUs 


DESCRIPTION: 

The IDT79R3010A Floating-Point Accelerator (FPA) op- 
erates in conjunction with the IDT79R3000A Processor and 
extends the IDT79R3000As instruction set to perform arith- 
metic operations on values in floating-point representations. 
The IDT79R3010A FPA, with associated system software, 
fully conforms to the requirements of ANSI/IEEE Standard 
754-1985, “IEEE Standard for Binary Floating-Point Arith- 
metic.” In addition, the architecture fully supports the standard’s 
recomendations. 

This data sheet provides an overview of the features and 
architecture of the 79R3010A FPA. A more detailed descrip- 
tion of the operation of the device is incorporated in the 
FRR3000A Family Hardware User’s Manual, available from IDT, 
and a more detailed architectural overview is provided in the 
MIPS RISC Architecture book, available from MIPS/SGI. 
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Figure 1. IDT79R3010A Functional Block Diagram 


The IDT logo ts a registered trademark of Integrated Device Technology, tnc. 
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IDT79R3010A FPA REGISTERS _ 
The IDT79R3010A FPA provides 32 general purpose 32- 
bit registers, a Control/Status register, and a Revision Identi- 


General Purpose Registers 
(FGR/FPR) - 


. 63 32 31 0 





COMMERCIAL TEMPERATURE RANGE 


fication register. The tightly-coupled coprocessor interface 
causes the register resources of the FPA to appear to the 
systems programmers as an extension of the CPU internal 
registers. The FPA registers are shown in Figure 2. . 


Control/Status Register 
31 


| Exceptions/Enables/Modes 


Implementation/Revision 
31 egister 0 


oo) 
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Figure 2. IDT79R3010A FPA Registers 


Floating-point coprocessor operations reference three types 
of registers: | | 
« Floating-Point Control Registers (FCR) 
* Floating-Point General Registers (FGR) 
- Floating-Point Registers (FPR) 


Floating-Point General Registers (FGR) 

There are 32 Floating-Point General Registers (FGR) on 
the FPA. Theyrepresentdirectly-addressable 32-bit registers, 
and can be accessed by Load, Store, or Move Operations. 


Floating-Point Registers (FPR) 

The 32 FGRs described in the preceding paragraph are 
- also used to form sixteen 64-bit Floating-Point Registers 
_ (FPR). Pairs of general registers (FGRs), for example FGRO 
and FGR1 (refer to Figure 2) are physically combined to form 
a single 64-bit FPR. The FPRs hold a value in either single- 
or double-precision floating-point format. Double-precision 
format FPRs are formed from two adjacent FGRs. 


Floating-Point Control Registers (FCR) 

There are 2 Floating-Point Control Registers (FCR) onthe 
FPA. They can be accessed only by Move operations and 
include the following: 
¢ Control/Status register, used to control and monitor 

exceptions, operating modes, and rounding modes; 

* Revision register, containing revision information about 
the FPA. 


COPROCESSOR OPERATION 

The FPA continually monitors the IDT79R3000A processor 
instruction stream. If an instruction does not apply to the 
coprocessor, It is ignored; if an instruction does apply to the 
coprocessor, the FPA executes that instruction and transfers 
necessary result and exception data synchronously to the 
IDT79R3000A main processor. 


The FPA performs three types of operations: 
¢ Loads and Stores; 
* Moves; 
¢ Two- and three-register floating-point operations. 


Load, Store, and Move Operations 

Load, Store, and Move operations move data between 
memory or the IDT79R3000A Processor registers and the 
IDT79R3010A FPA registers. These operations perform no 
format conversions and cause no floating-point exceptions. 
Load, Store, and Move operations reference a single 32-bit 
word of either the Floating-Point General Registers (FGR) or 
the Floating-Point Control Registers (FCR). 


Floating-Point Operations 
The FPA supports the folowing single- and double-preci- 
sion format floating-point operations: 
* Add 
* Subtract 
¢ Multiply 
¢ Divide 
¢ Absolute Value 
¢« Move 
¢« Negate 
* Compare 
In addition, the FPA supports conversions between single- 
and double-precision floating-point formats and fixed-point 
formats. 
The FPA incorporates separate Add/Subtract, Multiply, 


_ and Divide units, each capable of independent and concurrent 


operation. Thus, to achieve very high performance, floating 
point divides can be overlapped with floating point multiplies 
and floating point additions. These floating point operations 
occur independently of the actions of the CPU, allowing 
further overlap of integer and floating point operations. Figure 
3 illustrates an example of the types of overlap permissible. 





IDT79R3010A/AE 
RISC FLOATING POINT ACCELERATOR 


Only Load, Store, and Move operations 
are permitted in FPA during these cycles. 


Other FPA instructions can proceed during 
these cycles. However, two multiply or two 
divide operations cannot be overlapped. 





These cycles are free for integer operations 
in the CPU. 


fo ad 
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2873 drw 03 


Figure 3. Examples of Overlapping Floating Point Operation 


Exceptions 

The IDT79R3010A FPA supports all five IEEE standard 

exceptions: 

Invalid Operation 

Inexact Operation 

Division by Zero 

Overflow 

Underflow 

The FPA also supports the optional, Unimplemented Op- 
eration exception that allows unimplemented instructions to 
trap to software emulation routines. 

The FPA provides precise exception capability to the CPU; 
that is, the execution of a floating point operation which 
generates an exception causes that exception to occur at the 
CPU instruction which caused the operation. This precise 
exception capability is a requirement in applications and 
languages which provide a mechanism for local software 
exception handlers within software modules. 


e 


INSTRUCTION SET OVERVIEW 

All |DT79R3010A instructions are 32 bits long and they can 
be divided into the folowing groups: 
Load/Store and Move instructions move data between 
memory, the main processor and the FPA general 
registers. 
Computational instructions perform arithmetic operations 
on floating point values in the FPA registers. 
Conversion instructions perform conversion operations 
between the various data formats. 
Compare instructions perform comparisons of the con- 
tents of registers and set a condition bit based on the 
results. The result of the compare operationis output onthe 
FpCond output of the FPA, which is typically used as 
CpCond1 on the CPU for use in coprocessor branch 
operations. 
Table 1 lists the instruction set of the IDT79R3010A FPA. 


OP | Description [| OP | Description 


Load/Store/Move Instructions 
Load Word to FPA 

Store Word from FPA 

Move Word to FPA 

Move Word from FPA 

Move Control word to FPA 
Move Control word from FPA 


LWC1 
SWC1 
MTC1 
MFC1 
CTC1 
CFC1 


Conversion Instructions 
CVT.S.fmt 
CVT.D.fmt 
CVT.W.fmt 


| Floating-point Convert to Single FP 
Floating-point Convert to Double FP 
Floating-point Convert to fixed-point 





Computational Instructions 
Floating-point Add 
Floating-point Subtract 
Floating-point Multiply 
Floating-point Divide 
Floating-point Absolute value 
Floating-point Move 
Floating-point Negate 
Compare Instructions 
Floating-point Compare 


ADD.fmt 
SUB.fmt 
MUL. fmt 
DIV.fmt 

ABS.fmt 
MOV.fmt 
NEG.fmt 


C.cond.fmt 
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Table 1. IDT79R3010A Instruction Summary 
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ID79R3010 PIPELINE ARCHITECTURE 
The IDT79R3010A FPA provides an instruction pipeline 
that parallels that of the IDT79R3000A processor. The FPA, 


however, has a 6-stage pipeline instead of the 5-stage pipeline | 
of the IDT79R3000: the additional FPA pipe stage is used to 


provide efficient coordination of exception responses between 

the FPA and main processor. | | 
The execution of a single IDT79R3010A instruction con- 

sists of six primary steps: 

1) IF—Instruction Fetch. The main processor calculates the 
insruction address required to read an instruction from the 
l-Cache. No action is required of the FPA during this pipe 
stage since the main processor is responsible for address 
generation. | 

2) RD—The instruction is present on the data bus during 
phase 1 of this pipe stage and the FPA decodes the 
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instruction on the bus to determine if it is an instruction for 
the FPA. 

3) ALU—lIf the instruction is an FPA instruction, instruction 
execution commences during this pipe stage. 

4) MEM—If this is a coprocessor load or store instruction, the 
FPA presents or captures the data during phase 2 of this 
pipe stage. 

5) WB—The FPA uses this pipe stage solely to deal with 
exceptions. 

6) FWB—The FPA uses this stage to write back ALU results 
to its register file. This stage is the equivalent of the WB 
stage in the IDT79R3000A main processor. 

Each of these steps requires approximately one FPA cycle 
as shown in Figure 3 (parts of some operations spill over into 
another cycle while other operations require only 1/2 cycle). 


INSTRUCTION EXECUTION 


Register file 


write back or FP 
exceptions 





One Cycle 
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Figure 4. IDT79R3010A Instruction Summary 





Instruction 
Flow 





Current 
Cycle 
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Figure 5. IDT79R3010A Instruction Pipeline 


The IDT79R3010A uses a 6-stage pipeline to achieve an 
instruction execution rate approaching one instruction per 
FPA cycle. Thus, execution of six instructions at a time are 


overlapped as shown in Figure 5. 


This pipeline operates efficiently because different FPA 


‘resources (address and data bus accesses, ALU operations. 
_ register accesses, and so on) are utilized on a non-interfering 


basis. 
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PACKAGE THERMAL SPECIFICATIONS 


The IDT79R3010A utilizes special packaging techniques 
to improve both the thermal and electrical characteristics of 
the floating point accelerator. 

In order to improve the electrical characteristics of the 
device, the package is constructed using multiple signal 
planes, including individual power planes and ground planes 
to reduce noise associated with high-frequency TTL parts. 

In order to improve the thermal characteristics of the 
floating point accelerator, the device is housed using cavity 
down packaging for the flatpack and the PGA (the J-bend 
CerQuad is cavity up). In addition, these packages incorpo- 
rate a copper-tungsten thermal slug designed to efficiently 
transfer heat from the die to the case of the package, and thus 
effectively lower the thermal resistance of the package. The 
use of an additional external heat sink affixed to the package 
thermal slug further decreases the effective thermal resis- 
tance of the package. 

The case temperature may be measured in any environment 
to determine whether the device is within the specified oper- 
ating range. The case temperature should be measured atthe 
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COMMERCIAL TEMPERATURE RANGE 


center of the top surface opposite the package cavity (the 
package cavity is the side where the package lid is mounted). 

The equivalent allowable ambient temperature, TA, can be 
calculated using the thermal resistance from case to ambient 
(ca) for the given package. The following equation relates 
ambient and case temperature: 

TA= 1c -P*@ca 

where P is the maximum power consumption, calculated by 
using the maximum Icc from the DC Electrical Characteristic 
section. 

Typical values for @ca at various airflows are shown in 
Table 2 for the various CPU packages. 








@ca (84-PGA) 
Oca (84-Flatpack) 
Oca (84-CerQuad) 
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Table 2. Thermal Resistance (Oca) at Various Airflows 
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PIN CONFIGURATION) 
(Top View) | , 
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|__| Data (30) 
ae Data (29) 
| |: Data (23) 
|__| Data (22) 
-—] Data (21) 
|__| Data (20) 
7 veci4 
[—_] Data (18) - 
-__] Data (17) | 
[J Data (16) 


o.{ | ‘Data (27) 
“FF 7] veco © 
o [7] GNDO 

aft] Data (26) 
ae Data (25) 
© [] Data (24) 


oO 
a 
ow 
— 
wo 
-O 
9 


~™% [77] DataP (2) 





GND13 


Clk2xRd [___]. 
FpSysin [1 -——]_ DataP (1) 
Data (31) DC [—} vec12 
VCC1 {[- 7] GND12 
GND1 [<=]. FpCond 
- DataP (3) (OO -——] FpBusy 
Clk2xSys -——] Exception 
Clk2xSmp [___| [77] Run 
Cik2xPhi [___] T] Resvd2 
Reset Con 84-Pin J Bend FJ = resvas 
FpSyne [ — ver 
vecz [__ [__] GND11 
GND2 L__ [_-] VCC10 
vec3 [__ -——] GND10 
GND3 [7] -——] FpPresent 
PLLOn [|] 28 | Resvdo 
vec4 LC [——] vocg 
GND4 [___] [——} GND9 
vcecs [__] [——~] vcocs 
GND5 C__] [__] GND8 
Sewer sg ftEHPoeoteeEweoocr Agnvcgcsesys ws 
os 8 oe © ZO we 8B we we He Be we rMHe KH ZOTLSrelOTrelTerelh fT 
BS IS iS a te 8 iS ® s . s & § 5 o ie s s g g 2873 drw 06 
EY QO aa a 
NOTE: 


1. Reserved pins must not be connected. 
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PIN CONFIGURATION") 
(Ceramic, Cavity Down) - BOTTOM VIEW 






M Data DataP FP 
Data Data Data Excep- 
Data 
; 
Data 
DataP 
: 
Data 
G 25 : es ‘ 
84-Pin Ceramic Pin Grid Array 
i 
Data 
. 
Data 
: 
: 
Data 
° 
A Vcc 
1 2 3 4 5 6 i 8 9 10 11 12 
2873 drw 07 
NOTE: 


1. Reserved pins must not be connected. 
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PIN CONFIGURATION 
84—L QUAD FLATPACK (CAVITY DOWN) 
TOP VIEW 


84 
1 

Data (30) ("JJ 
Data (29) (7 
Data (28) ("J 
Data (27) ("> 
VCC (7 
GND (J 
Data (26) ("J 
Data (25) ("J 
Data (24) ("J 
DataP (2) (J 
Data (23) ("J 
Data (22) [J] 
Data (21) ("7 
Data (20) [7] 
VCC [7] 
GN 
Data (19) ("J 
Data (18) [~~] 
Data (17) ("7 
Data (16) 
VOC 


21 
22 


NOTE: 
Reserved pins must not be connected. 


1. 


[_] Clk2xRd 


C] Fpsysin 


[ ———] Data (31) 


> 


GND Ld 


DataP (1) 


vec LL 
GND [Lo 
FpCond (___ 
FpBusy (__ 


-——__] veo 


Pa 1 GND 


________] DataP (3) - 
pd} Fp SysOut © 


ae Clk2xSys 


[7 Clk2xSmp 


ees Clk2xPhi 








[~~] Reset 
P| «Fp Syne 
fo | VCC 
P| GND 
Ee | MOG 
TJ GND 


HEATSINK 


Folnt Ld 
Exception [__ 





Run Ld 
Resvd2 [_ 
Resvd1 [ 





VCC Co 
GND Lt 
VCC: 
GND (_d 
FpPresent L__. 
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TC d)CGND 


Oo) 


Lee et PLLOn 
pe VOCE 


o 
BA 


VCC 
Data (4) 
Data (5) 
Data (6) 
Data (7) 
DataP (0) 
Data (8) 
Data (9) 
Data (10) 
Data (11) 
GND 
VCC 
Data (12) 
Data (13) 
Data (14) 
Data (15) 
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PIN DESCRIPTIONS 


PPinnome [vO] Cin 
Data (0-31 /O | A multiplexed 32-bit bus used for instruction and data transfers on phase 1 and phase 2, respectively. 


-31) 
DataP (0-3) A 4-bit bus containing even parity over the data bus. Parity is generated by the FPA on stores 


Input to the FPA which indicates whether the processor-coprocessor system is in the run or stall state 
| 














Signal to the CPU indicating a request for a coprocessor busy stall 


Signal to the CPU indicating the result of the last comparision operation 
Signal to the CPU indicating that a floating-point exception has occured for the current FPA instruction 


Synchronous initialization input used to distinguish the processor-FPA synchronization period from the 
execution period. Reset must be synchronized by the leading edge of SysOut from the CPU. 


| Exception | Input to the FPA which indicates exception related status information. 
FoBusy indicati ; | 






| FoCond 
| Foint | 


















: 


Input which during the reset period determines whether the phase lock mechanism is enabled and during the 
execution period determines the output timing model. 
















Output which is pulled to ground through an impedance of approximately 0.5kQ. By providing an external 
pullup on this line, an indication of the presence or absence of the FPA can be obtained. 


( 2 
n 
Clik2xSys 
Clk2xSmp 
Clk2xRd 
Clk2xPhi 
| 
Cc 


| 


| 
psysOut 








Input used to receive the synchronization clock from the FPA 





t 
X 
X 
x 
X 
y 
y 





Input used to receive the synchronization clock from the CPU 
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ABSOLUTE MAXIMUM RATINGS": *) RECOMMENDED OPERATING 


a a a ei ZAND SUPPLY VOLTAGE 
Terminal Voltage | —0.5to +7.0 V ees 


with Respect et 0°C to +70°C_ 5.0 5% 
16-33MHz _ (Ambient) 
































to GND 










0 to +70) 









2873 tbl. 06 


TA, TC Operating 
Temperature (Ambient) Commercial 0°C to +90°C 5.0 +5% 
7 0 to +908) 40MkHz ~ _ (Case) | 


(Case) 
Under Bias 0 to +90°) 
| tenets | RP | 
Temperature 
ead 


Input Voltage —0.5 to +7.0 


ae TE: 2873 tbl 04 

1. Stresses greater than those listed under ABSOLUTE MAXIMUM RAT- 
INGS may cause permanent damage to the device. This is a stress rating 
only and functional operation of the device at these or any other conditions 
above those indicated in the operational sections of this specification is not 
implied. Exposure to absolute maximum rating conditions for extended 
periods may affect reliability. 

2. VIN minimum = —3.0V for pulse width less than 15ns. 






OUTPUT LOADING FOR AC TESTING 





VIN should not exceed VCC +0.5 Volts. VREF Fo Device 
3. Notmore than one outputshould be shorted ata time. Duration of the short indar tect 
should not exceed 30 seconds. +1.5V 
4. 16-33MHz only. 
5. 40MHz only. | | | | 25pF 
, 2873 drw 09 
AC TEST CONDITIONS 


Fin [araiononae [8 
cece 


Input LOW Voltage 
Input HIGH Voltage 
Input LOW Voltage 
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DC ELECTRICAL CHARACTERISTICS FOR IDT79R3010A 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to + 70°C, Vcc = + 5.0 V+ 5%) 


16.67 MHz 
Symbol Parameter Test Conditions 









| VoH | OutputHIGH Voltage | Vcc=Min,loH=-4mA | 35 | 
| Vo. | Output LOW Voltage Vec=Min,lo.=4mA | — 
Output LOW Voltage) | Vcc=Min,lo=1.5mA [| — | 

, | 3.0 


VOH 
VOL P 
0 
Viv [_InputHIGH Voltage | 
pCO Votaget—[ 
vrs [pat HGH Votage@e7 [J 
| Vis | InputLOW Voltage? fo 
inputHIGH Vottage@) [sd 
| Vic | InputLow Voltage) | 
| cies 
Icc 
IH 
HL 
loz 


cn | Input Capacitancel eee 


0 













6) 

4) 

[input HIGH Leakage [Vn=veo | 
(3) 


3.5 

2.0 

3.0 

4.0 | 
| 525 


| Ii | Input LOW Leakage ViL = GND 
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DC ELECTRICAL CHARACTERISTICS FOR IDT79R3010AE 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to + 70°C, Vcc = + 5.0 V + 5%) 


a re Test Conditions | Min. 
|_Voure | Output LOW Voltage) | Voc= Min. lo.=1.5mA | — 









Input LOW Voltage() 


Input LOW Voltage(.2) 


) 
ViHS | Input HIGH Voltage(26) 
) 







cw [input Capaciance™ 
cout | Ourput Capacitance 
Cn 

ie 


) 
input HIGH Leakage | Va=Voo 
-toz__| Output Ti-state Leakage | Von=2av,vo-o5v | 100 | 100 | too | 100 | 
2873 tbl 08 
NOTES: 
Vit Min. = -3.0V for pulse width less than 15ns. Vit should not fall below —0.5V for larger periods. 
. VIHs and ViLs apply to Clk2xSys, Cik2xSmp, Clk2xRd, Clk2xPhi, FpSysin, FpSync and Reset. 
These parameters do not apply to the clock inputs. 
VIHC and Vitc apply to Run, PllOn and Exception. 
VOLFP applies to the FPPresent pin only. 
ViH and ViHs should not be held above Vcc + 0.5V. 
Guaranteed by design. 







VOH 
VOL 

VIH 

Vit 

CIN 

loc | 
ltH 

HL | 
lOZ 














Oy OV Oe 
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DC ELECTRICAL CHARACTERISTICS FOR IDT79R3010AE — 
COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to + 90°C, Vcc = + 5.0 V+ 5%) 












on 


| Von | Output HIGH Voltage Vcc =Min, lo¥=—4mA | 3 
| Vo. | Output LOW Voltage Vcc = Min, lol = 4mA 


0.4 
0.5 


V 

| Vv 

votre 

| Vv 
Cc 






oO 


) 
Input LOW Voltage(?) 
Input HIGH Voltage(2.6) 
Input LOW Voltage(1.2) 

3) 


} 


0 






2 
3 
4 





0.4 


Output Capacitance”) PO 


OH 
OL 
1H 
IL 
Input HIGH Voltage(4) 
Input LOW Voltage(1.4) 
IN 

Icc Operating Current | Vcc = 5.0V, Tc = 90°C | 
HH 
loz 


1 , 
| Cin | Input Capacitance(7) 
( 


750 
100 


ss hk 
oO oO . ~~ ° . 


Tit | InputHIGH Leakage | Vn=Voo__—d 
| lw | Input LOW Leakage) =] Vit = GND -100 
| loz | Output Tri-state Leakage | VoH=2.4V, VoL=0.5V | 100 100 


NOTES: 

Vit Min. = —3.0V for pulse width less than 15ns. Vit should not fall below —0.5V for larger periods. 
Vins and ViLs apply to Clk2xSys, Clk2xSmp, Clk2xRd, Cik2xPhi, FpSysin, FpSync and Reset. 
These parameters do not apply to the clock inputs. 

VIHC and Vitc apply to Run, PlilOn and Exception. 

VOLFP applies to the FPPresent pin only. 

ViH and ViHs should not be held above Vcc + 0.5V. 

Guaranteed by design. 
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AC ELECTRICAL CHARACTERISTICS FOR IDT79R3010A": 3) 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Vcc = +5.0V + 5%) 


igh | Input Clock HIGH(@) 
ToKLow Input Clock LOW(@) 






a7,) 


ymbo 
Clock 





ns 





a) 
” 






Got — Ff — 
pyopo 





Input Clock Period ns 
Clk2xSys to Clk2xSmp(°) ns 
Clk2xSmp to Clk2xRd(°) ns 





4] aya 

2) @) 

x x 

Uv ak 

& 

ZIiZz 

o1o 

we | st 

© | @ 

NIN 
© 
° 


™ 
ro) 


2xSmp to Clk2xPhi(5) 
Timing Paramters 


Tova (oade S5pF 
DH 

io in 
Tron | 
SExS 


Stall Set-up 
Stall Hold 


Reset Initialization 


ITrstPLL_| Reset Timing, Phase-lock on(*: ©) 3000 |  — | 300 | — | Teyc 
[ResetTiming, Phase-lockoff) [| S28 =| SO | Ste | Toye | 


Capacitive Load Deration 


[CLD__| Load Derate(® re ee ee ee ee ae ee ee 


NOTES: 2873 tbl 12 
1. All timings are referenced to 1.5V. 

. The clock parameters apply to all four 2xClocks: Clk2xSys, Cik2xSmp, Clk2xRd, and Clk2xPhi. 

This parameter is guaranteed by design. 

With PllOn asserted, Reset must be asserted for the longer of 3000 clock cycles or 200 microseconds. 

Tcye is one CPU clock cycle (two cycles of a 2x clock). 

No two signals on a given device will derate for a given load by a difference greater than 15%. 

. Clock transition time < 5ns. 
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AC ELECTRICAL CHARACTERISTICS FOR IDT79R3010AE(}3) 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Voc = +5.0V 45%). 













: Test Conditions | 


Input Clock Period 
Clk2xSys to Clk2XSmp(°) 
Clk2xSmp to Clk2xRd(5) 
Clk2xSmp to Clk2xPhi(9) 


iming Paramters | 


DEn | Data Enable) = 


Data Disable 


miro 


DVal a: 
|TRSDs | Reset Set-up | 
DS Data Set-up - 





=) 
w” 


Data Valid 


3 
” 


a]/ayala 
oO 
S) 
n 


4 
=) 
n 


ToH. 
pCond| Fp Condition z 
feusy|FpBusy 

Fpint 


F 


=) 
w 





=) 
1°72) 


3 
” 


pMov |Fp Move To | _ | 
| Exception Set-up (Run Cycle) 
Exception Set-up (Stall Cycle) 


Exception Hold eee 


Run Set-up 


4/Al/Aa}a}/ayja}ala 
mim~o]y] aim nm 
Cc |x -m{im 
3S |2iix | x 
dp) NLM 


~>iJ 
nrn 


=} 
n 


3 {5 
myn 


[TRund_ | Run Hold ae, 
Stall Set-up - oe 
Stall Hold 


Reset Initialization — 


=] 
~” 


+)o 
oO |D 
S15 
wm jo 


=) 
w 


| TStallH 





Capacitive Load Deration 


foxp [Load Dero) os to 8.0 ns2 pF 


NOTES: 2873 tbl 13 
1. All timings are referenced to 1.5V. 
2. The clock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Clk2xPhi. 
. This parameter is guaranteed by design. 
With PliOn asserted, Reset must be asserted for the longer of 3000 clock cycles ‘or 200 microseconds: 
. Tcyc is one CPU clock cycle (two cycles of a 2x clock). 
No two signals on a given device will derate for a given load by a difference greater than 15%. 
Clock transition time < 2.5ns for 33MHz; clock transition time < 5ns for all other speeds. 
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AC ELECTRICAL CHARACTERISTICS FOR IDT79R3010AE(: 3) 
COMMERCIAL TEMPERATURE RANGE (Tc = 0°C to +90°C, Vcc = +5.0V + 5%) 















Symbol 
Clock 


TckHigh | Input Clock HIGH(2) 
|TckLow | Input Clock LOW(2) 





ZiZz 







TCkP Input Clock Period 12.5 ns 
Clk2xSys to CIk2XSmp(°) 0 ns 
Clk2xSmp to Clk2xRd°) 0 ns 

D to Clk2xPhi(°) 3.0 ns 








Timing Paramters 


roade B5pE 
FTos [Data Setup ——SS—S 
[DataHold®) 


(3 
( 
Data Hold(9) 


TFpCond} Fp Condition 
TFpBusy | Fo Busy 


Fp Interrupt 


Fp Move To 
RExS | Exception Set-up (Run Cycle) | 
SExS | Exception Set-up (Stall Cycle) 


xH Exception Hold 


[TRuns 





| 
mv 
© 
=) 
wn 







! 
C}n 
S15 
nln 


3 
ee) 


O 
< 
» 


=) 
wn 


=) 
ies) 


Tos 
TDH 


— 
o 
3 jd 
mn 1” 











dja 
nm fm 
no} me) 
=is 
So |= 
< 


— 
Oo) 
=) 
” 


— 
93 jf J 
nO1n 


=) 
n 


m 


Run Hold | 
Stall Set-up 
| Stall Hold | 


| Reset Initialization 


Reset Timing, Phase-lock on(* °) 
| Reset Timing, Phase-lock off(®) 


Capacitive Load Deration 


=) 
n 


= 
Yn 


| ae | o no] 
a & 
a | 
op) 


+] 4] 
n,m ~D 
ND) on Iz 
a ae 
TIwW JL 
=) 
” 


a) 
=) 
” 


3000 
128 


Tceyc 





aya 
= = 
n n 
oe _- 
vU 
cr 
Cc 











NOTES: 2873 tbl 14 
1. All timings are referenced to 1.5V. 

2. The clock parameters apply to all four 2xClocks: Clk2xSys, Clk2xSmp, Clk2xRd, and Clk2xPhi. 

3. This parameter is guaranteed by design. 

4. With PllOn asserted, Reset must be asserted for the longer of 3000 clock cycles or 200 microseconds. 

5. Tcyc is one CPU clock cycle (two cycles of a 2x clock). 

6. No two signals on a given device will derate for a given load by a difference greater than 15%. 

7. Clock transition time < 2.5ns. 
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Teklow —— Tckp - Tekhigh 


Clk2xSys 
Tsmp 


olk2xSmp 


a 


Clik2xRd 


Clk2xPhi 
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Figure 6. Input “2x” Clock Timing 


FpSysOut 
FpSmpOut* 
FpRdOut* 


FpPhiOut* 
Tsys 
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Figure 7. Processor Reference Clock 


* These signals are not actually output from the floating point processor. 
They are drawn to provide a reference for other timing diagrams. — ; ; 
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FPA Store FPA Load 
Phase : 1 
FoSysOut 
FpPhiOut 
Aun ie / 
Fee te See, aa 
Trunh TstallS 
2873 drw 12 
Figure 8. Floating Point Load/Store Timing 
MoveTo MEM Access MoveTo Writeback 

Phase 
FpSysOut 
FpPhiOut 
FpCond 


Tfpmov | Tfomov 
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Figure 9. Move to FPC Status Timing 
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Phase { 
FpSysOut 
Fi Ka > 
| 
Tfpint Tfpint 2873 drw 14 
Figure 10. Floating Point Interrupt Timing 
FPCompareMEM 
Phase 1 
FpSysOut Tsys Tsys 
FpPhiOut 


recon <a, 
: Tfpcond Tfpcond 
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Figure 11. Floating Point Condition Timing 


6.1 . 18 


IDT79R3010A/AE 
RISC FLOATING POINT ACCELERATOR COMMERCIAL TEMPERATURE RANGE 


Phase 
“™ 
FpPhiOut 
Td 
FpBusy er NF PN es 


res aes 





Tsys 






Exception g 
——— 


TRExS -TEXH tile —— 
Bun <—>| TStallH 
2873 drw 16 
Figure 12. Floating Point Busy, Exception Timing 
Phase 1 2 1 2 1 2 1 2 At 2 





Pion aay i 


Reset 


Vec 
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Figure 13. Power-On Reset Timing 
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ORDERING INFORMATION 





IDT XXXXX — XX X X 
Device Type Speed —_ Package Process/ 
Temperature 
Range 





VALID COMBINATIONS 
IDT 79R3010A - 16, 20 All packages 
79R3010AE - 25, 33, 40 G 


6.1 


= Blank 


79R3010A 
79R3010AE 


COMMERCIAL TEMPERATURE RANGE 


_ Commercial (0°C to +70°C) 


84-Pin Quad Flatpack (Cavity Down) 
84-Pin PGA (Cavity Down) 
84-Pin J-Bend CerPack (Cavity Up) 


16.67 MHz 
20.0 MHz 

25.0 MHz © 
33.33 MHz 


40 MHz 


Floating Point Accelerator 
Enhanced Timing Floating Point Accelerator 
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BiCameral™ CacheRAM™ PRELIMINARY 
288K (16K x 9 x 2) IDT71B229S © 
FOR RISC CACHES 


Integrated Device Technology, Inc. 





FEATURES: DESCRIPTION: 
¢ Supports the R3000 and R3001 to 40MHz The IDT71B229 is a BiCameral CacheRAM specifically 
¢ BiCameral organization: designed to support the split instruction and data caches of the 
— Split instruction/data cache support IDT79R3000 microprocessor. A complete 128KB cache for 
— No bank-switching timing contention the R3000 or the R3500 can be built with only six to seven 
* Single address bus IDT71B229s (depending on the main memory size supported 
¢ Single data bus by the system), while an R3001 cache can be built with five to 
* Separate write enable and output enable for each bank six parts. CPU clock frequencies up to 40MHz are supported. 
¢ Standard read and write control interface The small 300mil package allows a 128KB cache to fit in a 
¢ Internal address latches circuit board area of approximately two square inches. 
¢ 32-pin 300mil SOU package Internal address latches eliminate the need for external 


latches. The BiCameral (two bank) organization reduces the 
number of devices required to support the R3000's split-cache 
architecture and eliminates contention problems encountered 
when one RAM bank is being enabled while the other is being 
disabled. All timing parameters have been optimized to sup- 
port the complete range of 3000 clock speeds, simplifying 
R3000 cache design. 

The IDT71B229 provides dense caches in low board 
space while consuming minimum power. 
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PIN CONFIGURATIONS 





A5 A4 
Aé AB, 
DWE WE 
AT VO4- 
VOs oa 
VOs. 1 Oz 
_ Vec GND 
GND Vcc 
vor Vor 
VOs. Oo 
As A2 
Ag AL 
Ato Ao 
Att Ai3 — 
Ai2 DCLK 
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TRUTH TABLE 1 












Function 
| 
(ou. tBank | __—Read Bankoste 
[—Hign2 | Write asta Bank 





High-Z- . . Not Allowed 
| High-Z Not Allowed 


| High-Z Not Allowed 


High-Z Not Allowed 





High-Z - Not Allowed | 
Not Allowed 
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High-Z 





: af: o 
TEEEEFEE TT ; 





TRUTH TABLE 2") | - 


| 






NOTE: 2996 tbi 02 
1. L = LOW, H = HIGH, X = Don't Care, = Unrelated, High-Z = High- 
Impedance 
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PIN DESCRIPTION 


ee 


DCLK, when HIGH, allows the address inputs to flow through the D bank's address latch. Conversely, the address in 
the | bank's latch is held during a HIGH input on DCLK. Taking DCLK LOW freezes data in the D bank's address 
latch and allows addresses to flow through the | bank's address latch. 


| Output Enable enables the data outputs from the | bank onto the data input/output pins. IOE must not be asserted 
simultanteously with the DOE, DWE or IWE pins. 


This is an input which enables the data outputs from the D bank onto the data input/output pins. DOE must not be - 
asserted simultanteously with the IOE, IWE or DWE pins. : ; 




















| Write Enable, when LOW, gates data from the input/output pins into the RAM at the | bank address indicated by the 
output of the | bank address latch. Neither DOE nor IOE should be enabled during a write operation. 







D Write Enable is an input which is taken LOW to gate data from the input/output pins onto the RAM at the address 
being output from the D bank address latch. Neither DOE or IOE should be asserted during a write operation. 











The fourteen address inputs are used to access any of the 16,384 locations in either the D or | bank. When an 
address latch is in the transparent state, these pins are routed directly to that latch's RAM bank. Taking the latch into 
its latched state causes that RAM bank to ignore subsequent changes on the address input pins. 












CAPACITANCE (Ta = +25°C, f = 1.0MHz) ABSOLUTE MAXIMUM RATINGS") 


Symbol] Parameter | Conaitions | Max[ Unt] [Symbot | Rating | Commercial | unit _| 
Input Capacitance . Terminal Voltage 0.5 to +7.0(2) V 
Pe with Respect | 
to GND 
NOTE: 2996 tbl 04 io 


F ° 
1. This parameter is determined by device characterization, but is not : perating Oto +70 C 
production tested. emperature 
TBIAS Temperature —-55 to +125 °C 
Under Bias | 
TsTG Storage -55 to +125 °C 
| Temperature 


|Pr__| PowerDissipation | 1.0 |W 
lout DC Output 50 mA 
Current 


NOTES: - “2996 thi 06 
1 


The input/output bus comprises nine signals whose functions are determined by the state of the IOE, IWE, DOE and 
DWE pins. During Output Enables, data is output upon these pins from the selected RAM bank from an address 

pointed to by the outputs of that bank's address latch. When either Write Enable is asserted, data can be written fro 
these pins into the selected bank's RAM at the address being output by that bank's address latch. When JOE, IWE, 
DOE and DWE are all inactive, the input/output pins are floated in a high-impedance state. 
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OU 
nN 











RECOMMENDED DC OPERATING 
CONDITIONS 


Symbol[ Parameter Min. | Typ, 
Supply Voltage | 4.75 | 5.0 | 5.25 






Supply Voltage 30k | : : ‘ - 
. Stresses greater than those listed under ABSOLUTE. MAXIMUM 
Input HIGH Voltage RATINGS may cause. permanent damage to the device. This is a stress 





Inout LOW Voltage | -0 rating only and functional operation of the device at these or any other 
; conditions above those indicated in the operational sections of this 


NOTE: — 2996 tol 05 specification is not implied. Exposure to absolute maximum rating 
1. Vit (min.) = —3.0V for pulse width less than 20ns. conditions for extended periods may affect reliability. 
| 2. VTERM must not exceed Vcc+0.5V. 








IDT71B229S 
BiCMOS STATIC RAM 288K (16K x 9x 2) BiCameral CacheRAM 
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DC ELECTRICAL CHARACTERISTICS'” 2) 
(Vcc = 5.0V + 5%) 








| | ee a 7 718229812 | 718229816 | 71B229S22 | 71B229S28 
Symbol _ | Parameter | 7 7 | Com’. | Co Com’l. | Com’! | 
Icc1 Operating Power Supply Current ; 190 | 1 170 160 
| os {| Outputs Open, Vcc = Max., f = 0 | “4 


— 250 





Dynamic Operating Current 
Outputs Open, Vcc = Max., f = fMAx 


Icc2 


NOTES: 
1. All values are maximum guaranteed values. 
2. fuax=1/tcyc with address bits cycling at maximum frequency. 


f = 0. means no inputs change. 





m’l. 
= ta te | 
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DC ELECTRICAL CHARACTERISTICS OVER THE : 










Input Leakage Current 


Symbol 


Output Low Voltage - ' 
Output High Voltage Z 





ACCESSTIME AND a 
CLOCK FREQUENCY EQUIVALENTS 


| 3000/1 Clock Frequency | 71B229 Access Time | 
| _33 MHz fee OM 










| 20 MHz . 


28 ns . 
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RECOMMENDED OPERATING 
TEMPERATURE AND SUPPLY VOLTAGE 

| Grade | Ambient Temperature 
vas | 














nae 
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OPERATING TEMERATURE AND SUPPLY VOLTAGE (Vcc = 5.0V + 5%) 


Test Condition 
Vcc = Max., VIN = GND to Vcc 


Output Leakage Current Vcc = Max., Vout = GND to Vcc a a 





1OL = 8MA, Vcc = Min. 





IOH = -4mA, Vcc = Min. 
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AC TEST CONDITIONS | 


Input Pulse Levels a 
Input Rise/Fall Times | | _ 5ns 
Input Timing Reference Levels 1.5V | 





GND to 3.0V 


Output Reference Levels 1.5V 
| Output Load See Figures 1A, 1B. 1C 
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+1.5 Volts ig 
500 6 
DATAOouT 5 
= = ATAA 4 
e 7 (Typical, ns) 
2996 drw 03 3 
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Figure 1A. AC Test Load 


sh 


20 40 60 80 100 120 140 160 180 200 
CAPACITANCE (pF) 
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5V 


Figure 1C. Lumped Capacitive Load, Typical Derating Curve 


71B229 
| Addr(13:0) 
/0(8:0) 


—- LEAR AREA 
Buses 
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4800 


DATAOUT 


2550 5pF* 


2996 drw 04 
*Includes scope and jig. 


Figure 1B. AC Test Load (for toLz & toHz) 





79R3000 


AdrLo (15:2) 


|Rd* 





NOTE: 
1. Loading of the IRd, |Wr, DRd and DWr signals should be split evenly between the pair of R3000 pins dedicated to each of these functions. 


Figure 2. Example of Cache Memory System Block Diagram 
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AC ELECTRICAL CHARACTERISTICS (Vcc = 5.0V + 5%, All Temperature Ranges) 
[riszzasta] rieaassie]| Tiee20sz2| 716220508 | 

mon! remem ae We [ie [| 
Read Cycle 2 . | | 
five | Reade tne” SSCS YT 
Phases Seuptine oe Po 
Padres Hol Tine 

i a 

[ouput enabetine Sd 
fea | oaparDsanetoOapainnenz de |S 


NOTES: | 2996 tbl 12 
1. One cycle includes both a D bank read or write and an | bank read or write. . 
2. This parameter is guaranteed with the AC test load, Figure 1B,due to device characterization, but is not production tested. 
















f 





TIMING WAVEFORM OF READ CYCLES") 


tcYyc | 
tsu tH 
DCLK 
tAA ~~ tAA 


DOE | | = 
1OE | 

toHz') 
: ee ee ae 


NOTES: , 
1. DWE and IWE must be high during read cycles. 
2. The transition is measured +200mV from steady state with load in Figure 1B. 
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AC ELECTRICAL CHARACTERISTICS (Vcc =5.0V + 5%, All Temperature Ranges) 


71B229S12 | 71B229S16 | 71B229S22 | 71B229S28 
| Symbol | Parameter | Min. | | 


Write Cycle 
| tcyc 
Address Hold Time 
Address to End-of-Write 


Address to Start-of-Write 


Write Pulse Width 
Data to Write Time Overlap | 5.5 
Data Hold from Write Time 


NOTE: 2996 tbl 13 
1. One cycle includes both a D bank read or write and an bank read or write. 





re 
las 
ow 


TIMING WAVEFORM OF WRITE CYCLES" 2) 


tcYCc 


1SU tH | 
DCLK 


tAS 


TAS 


tAW tWR 


DWE 
twP twp 
IWE 


(iam 


OTES: 2996 drw 08 
DOE and IOE are high during write cycles. 


DWE must be high or DCLK must be low during all address transitions. Likewise, IWE or DCLK must be high during all address transitions. 






/O 





NS = 
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TIMING WAVEFORM OF MIXED READ AND WRITE CYCLES" 2: 3) 


tcyc 


Address D Address . a. | Address KY D Address _ 


tsu tH 


DCLK 






tWR 


wwe : | 
tAS twe —— 
OE | 
re | aia toHz 4) 
/O 


| Data Out D Data In | K | Dataout > Out 


NOTES: 2996 drw 09 


1. ae and IOE are high during write cycles. 

2. DWE must be t be high or DCLK must be low during all address transitions. Likewise, IWE or DCLK must be high during all address transitions. 
3. owe and IWE must be high during read cycles. 

4. The transition is measured #200mV from steady state with load in figure 1B. 


ORDERING INFORMATION 











IDT XXXX X XX X X 
Device Power Speed Package Process/ 
Type Temperature 
Range 
ail Blank Commercial (0°C to +70°C) 
) rane 300-mil SOU (SO32-2) 
12 
16 Access Time in nanoseconds 
22 
28 


S Standard Power 


71B229 16K x 9 x 2 BICEMOS SRAM 
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RISC CPU WRITE BUFFER IDT79R3020 





Integrated Device Technology, Inc. 





FEATURES: DESCRIPTION: 

¢ Temporary storage buffers to enhance the performance The IDT79R3020 Write Buffer enhances the performance 
of the IDT79R3000 RISC CPU processor of IDT79R3000 systems by allowing the processor to perform 

¢ Allows for write operations by the RISC CPU processor write operations during Run cycles instead of resorting 10 
during Run cycles time-consuming stall cycles. Each |DT79R3020 device handles 

¢ Each Write Buffer has four locations to handle an 8-bit an 8-bit slice of address, and a 9-bit slice of data (one parity 
address slice and a 9-bit data slice (including a parity bit) bit per byte); thus, four IDT79R3020s provide 4-deep buffer- 

« High-speed CMOS technology ing of 32 bits of address and 36 bits of data and parity. Figure 

¢ Pin, functionally and software compatible with the MIPS 1 illustrates the functional position of the Write Buffer in an 
Computer Systems R2020 Write Buffer IDT79R3000 system. 

* Speeds from 16.7 to 40MHz Whenever the processor performs a write operation, the 

- Also works with Intel i486™ for Writeback secondary Write Buffer captures the output data and its address (includ- 
cache ing the access type bits). The Write Buffer can hold up to four 


data-address sets while it waits to pass the data on to main 
memory. Transfers from the processor to the write buffers 
occur synchronously at the cycle rate of the processor and the 
write buffer signals the processor if it is unable to accept data. 
The write buffer also provides a set of handshake signals to 
communicate with a main memory controller and coordinate 
the transfer of write data to main memory. 

The sections that follow describe these IDT79R3020 Write 
Buffer interfaces: 
¢ the processor-Write Buffer interface 
¢ the Write Buffer-main memory interface 
* a miscellaneous, Write Buffer-board control interface. 





WRITE BUFFER 





a tas Address 
IDT79R3020 


Write Buffers Main 


Memory 


IDT79R3000 Controller 


Processor 


Control Control 
Signals Signals 


Dat 
[D31:00} » i B eS Write Data 


Read Data 
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Figure 1. The IDT79R3020 Write Buffer in an IDT79R3000 System 


The IDT logo is a registered trademark of Integrated Device Technology, Inc. 
i486 is a trademark of Intel Corporation. 
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IDT79R3020 
RISC CPU WRITE BUFFER 


WRITE BUFFER - IDT79R3000 PROCESSOR | 
INTERFACE 


Figure 2 shows the signals comprising the Write Buffer 
interface to the IDT79R3000 (all descriptions assume that four 
IDT79R3020 Write Buffers are used to implement a 32-bit, 
buffered interface). The AdrLo bus and Tag bus bits from the 
processor are both connected to the Write Buffer to form a 32- 


SysOut 


Address Bus 
(AdrLo & Tag) 


Data Bus 


IDT79R3000 and Parity © 


Processor 
. AccTypo 
AccTyp1 


MemWr 
| WrBusy 
CpCondd 
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bit physical address that is captured by the buffers. Thirty-two 
bits of data, four bits of parity, and two access type bits are also 


captured by the Write Buffer. The paragraphs that follow 


describe the Write Buffer-processor interface signals andthe 
timing of processor-to-Write Buffer data transfers. te 


Clock 


Addrin7:0 


Address 1 0... 


Dataln8:0 
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Figure 2. Write Buffer — IDT79R3000 Processor Interface 


Write Buffer-Processor Interface Signals | 


Clock 

An inverted version of the R3000s SysOut signal from the 
R3000 processor that synchronizes data transfers. The Write 
Buffer uses the trailing edge of Clock to latch the contents of 
the AdrLo bus and uses the leading Clock edge to latch the 
contents of the Data and Tag buses. 


Dataln8:0 
_ Nine input data lines from the IDT79R3000 processor's 
Data bus (eight bits of data and one bit of BaD): 


Addrin7:0 
Eight input address lines from the IDT79R3000 processor. 
The address lines are taken from the AdrLo and Tag buses. 


Address1:0 
The two least significant address bits from the IDT79R3000 
processor. These two address bits must be:connected to all 


four Write Buffers and are used in conjunction with the access 


type (AccTyp1:0) signals, the Position1:0 signals, and the 
BigEndian signal to determine which byte(s) in a word are 
being written into a particular Write Buffer. 


AccTypin1:0 

The access type signals from the IDT79R3000 processor 
specifying the size of a data access: word, tri-byte, half-word, 
or byte. 


WwtMem 


This input is connected to the MemWr signal from the 
IDT79R3000 processor that is asserted whenever the proces- 
sor is performing a store (write) operation. 


Request 

The primary purpose of this signal is to request access to 
memory and is described later when the Write Buffer-Main 
Memory Interface is discussed. The Request signal can also 
be connected to the CpCond0 input of the IDT79R3000 and 
can then be tested by software to determine if there is any data 





inthe Write Buffer. Since Request is deasserted if there is no 


data in the Write Buffer, software can determine if a. previous 


~. write operation (for example, to an I/O device) has been 


completed before initiating a read or read status operation 
from that device. 


WbFull 
The Write Buffer asserts this signal to the IDT79R3000s 





_ WrBusy input whenever it cannot accept any more data; that 


is, when the current write will fill the buffer or the buffer has all 
address-data pairs occupied. The IDT79R3000 processor 
performs a write-busy stall if it needs to store data while the 
WbFull/WrBusy signal is asserted. 





Data & Address Connections 
Figure 3 illustrates how four Write Buffers are connected to 
the address and data outputs of the IDT79R3000 processor. 
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IDT79R3020 
RISC CPU WRITE BUFFER 


Address Inputs 

Each Write Buffer device has eight address inputs (Adrln7:0). 
The four low-order bits (Adrln3:0) are clocked into the device 
on the trailing edge of the Clock signal and are taken from the 
IDT79R3000s AdrLo bus. The four high-order bits (Adrln7:4) 
are clocked into the device on the rising edge of the Clock 
signal and are taken from the IDT79R3000s Tag bus. 

Each device also has separate inputs (Address1, Ad- 
dressO) for the two low-order bits from the AdrLo bus. These 
bits must be input to each device since they comprise the byte 
pointer. Note in Figure 3 that the two low-order Adrin inputs 
(Adrin1:0) to Write Buffer device 0 are connected to ground 
since the Address1 , Address0 inputs already supply these bits 
to the device. 





Data & 
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Data Inputs 

Each Write Buffer device has nine data inputs that are 
clocked into the device on the leading edge of the Clock signal 
and are taken from the IDT79R3000s Data bus. In Figure 3, 
each device captures eight bits of data and one bit of parity. 
Also note that the data bits assigned to each device corre- 
spond to the address bits connected to the device. This 
arrangement Is required since data selection is dependent on 
a combination of the AccType signals and the two low order 
address bits. The arrangement also simplifies system utiliza- 
tion of the “Read Error Address” feature described later. 

The Position? and PositionO signals shown in Figure 3 
specify the nibble position within a halfword that each write 
buffer device comprises. 













Parity Data Bus [35:00] 
Read 
Tag eee Buffer 
i eee ene <G eS 
AdrLo 
Reha 
Tag 31:28 Adr|n7:4 
(Pesca a 
ne <a nes ae 
Data35, 31:28 Dataln8:4 
Data 15:12 | Dataln3:0 Position 1 ) 
ae il ' Position 0 













Tag 27:24 
AdrLo 11:08 


Data33, 27:24 
Data 11:08 


Tag 23:20 | 


AdrLo 07:04 


a <a 


Data34, 23:20 


Tag 07:04 

























Data03:00 


Tag 19:16 Adrln7:4 
te 
Write 
AdrLo 03:02 é 
Adrin3:2 Buffer 
Adrin1:0 0 
* 
| Data32, 19:16 | Datalns:4 | 
Dataln3:0 Position oO “Q” 


Address1:0 
Adrin7:4 
Adr|In3:0 
Dataln8:4 


Dataln3:0 
Address1:0 


Adr|n7:4 
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Figure 3. Write Buffer Data and Address Line Connections 
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Write Buffer - Processor Timing a . - 
Transfers between: the processor and the Write Buffers When the WrtMem signal is asserted, the low-order ad- 
occur synchronously: the Clock signal from the processor is dress bits, and the Address 1:0 inputs, are latched on the 
input to the Write Buffers and used to clock the address and _ trailing edge of the Clock signal (1). The rising edge of Clock 
data information into the Write Buffers’ latches. Figure 4 (2)isusedtolatch the high-order address bits, the etcees Re 
illustrates the timing for the processor-Write Buffer interface. inputs and the contents of the data bus. | 


Clock | . 
WtMem ih 
Adrin3:0 
= tH 
Address 1:0 Bé) 


eee ee 
Adrin 7:4 ——,; 
(Tag) 
ek ——p 


Figure 4. Processor — Write Buffer Interface Timing 
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WRITE BUFFER - MAIN MEMORY INTERFACE 

Figure 5 shows the signals comprising the Write Buffer Clock signal is required, the handshaking signals in this 
interface to main memory. This interface is essentially interface have no direct connection to the operation of the 
decoupled from the Write Buffer-processor interface: although Write Buffer-processor interface. 
some synchronization of the memory interface signals and the 


‘ 


OutEn 


AddrOut 


AccTyp0O 
AccTyp1 Main 


Memory 
DataOut Controller 
and Parity 


Write Buffer (x 4) 


Request 


Acknowledge 
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Figure 5. Write Buffer — Main Memory interface 
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Write Buffer - Main Memory Interface Signals 
Each Write Buffer provides the following signals that com- 
prise the interface to a main memcry controller: 


AddrOut 7:0 
Eight address line output from each Write Buffer. 


DataOut 8:0 
Nine data lines from each Write Buffer (eight bits of data and 
one bit of parity). 


AccTypOut 1:0 
The access type signals from the Write Buffer specifying 
the size of a data access: word, tri-byte, half-word, or byte. 


OutEn 

The memory controller asserts this write input to enable the 
tri-state outputs of the IDT79R3020 address data signals, and 
AccTyp Out. 


Request 
The Write Buffer asserts this signal to inform the main 
memory system that it has data to be written to memory. 


Acknowledge | 

The main memory system asserts this signal when it has 
captured the data presented by the Write Buffer on the 
DataOut lines. 


Clock 
Request * aes a ae 


Acknowledge 
OutEn 


AddrOut 
DataOut 
AccTyp Out 
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Write Buffer - Main Memory Interface Timing 

Figure 6 illustrates the timing for the transfer of data from 
the Write Buffer to the main memory system. The sequence 
illustrated in this figure is as follows: 


1. When the Write Buffer has a data-address pair for 
transfer to the memory system, it asserts the Request 
signal. 

2. When memory system is ready to handle the Write 
Buffer data, it asserts the OutEn signal to enable the 
Write Buffers’ address and data outputs onto the system 
buses. 

3. When memory system no longer requires the Write 
Buffer address and data outputs, it asserts the Acknowl- 
edge signal. 

The Write Buffer responds to this signal by discarding 
the address-data pair that was just output. 

4. The memory system can deassert the OutEn signal to 
return the Write Buffers’ address and data outputs to 
their tri-state condition. 

5. Since the Request signal remains asserted, the memory 
system asserts the OutEn signal again to enable the 
next address-data pair onto the system buses. 

6. When memory system has accepted the second ad- 
dress-data pair, it again asserts the Acknowledge signal. 
lf the Write Buffer is now empty, it responds to this signal 
by deasserting the Request signal. 
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Figure 6. Write Buffer — Main Memory Interface Timing 


Note that the buffer’s interface to main memory |s not 
completely asynchronous: assertion of the Request signal by 
the Write Buffer is synchronized with the rising edge of Clock, 
and the Acknowledge signal input by main memory has a 
minimum set up and hold time in relation to the Clock signal. 


MISCELLANEOUS WRITE BUFFER - BOARD 
LOGIC INTERFACE 


The Write Buffers support several functions that utilize 
signals that do not fit neatly into the descriptions of either the 
processor or main memory interfaces. These functions and 
signals typically involve miscellaneous logic on a CPU board 
and include the following: 
¢ byte gathering 
* configuration connections (Big Endian, Position 1:0) 

* address matching logic 


* error address latch logic 
The sections that follow describe each of these categories. 


Byte Gathering 


The Write Buffers perform byte (half-word, tri-byte and 
word) gathering to decrease the number of write transfers to 
same location; that is, sequential writes to the same WORD 
address have their data combined into the same address-data 
pair buffer. 

Byte gathering is prohibited in the address-data pair that is 
currently available to the memory controller. Thus, the first 
write into an empty Write Buffer will not have subsequent 
writes gathered into it because it is currently available for 
output to memory. Writes to the same location (byte) may be 
overwritten in the Write Buffer if the gathering is not prohibited 
by the preceding rule. 








IDT79R3020 
RISC CPU. WRITE BUFFER 


The Write Buffers present address-data pairs to the main 
memory controller in the sequence in which they were re- 
ceived from the processor exceptin the case of gathered data, 
where bytes or half words canbe collected and written to main 
memory in a single write operation. If the address-data pair 
buffer is scheduled to be output, then gathering is inhibited 
and the buffer contents are presented to the main memory 
controller. Subsequent writes are then placed in another 
buffer. No reliance should be placed in any aspect of gathering 
(except that it only involves sequential writes to the same word 
address) as it is not readily deterministic. Non-sequential 
writes to the same word address are not gathered. 

Note that gathering can. require that two main memory 
controller references be used to empty.a single Write Buffer 
entry. For example, this can occur if Bytes 0 and 3 of a word 
are sequentially written. Where order in writing is important, 
such as in I/O controllers, software should avoid sequential 
accesses to the same word. In cases where write-read access 
ordering is important but reading of the write location is not 
desired, such as during I/O, then a write followed by a write to 
a dummy location followed by a read of the dummy location 
will insure the first write has occurred before continuing. 
Alternatively, the Request signal can be tested to determine 
that the Write Buffer is empty. 


Access 
a Uaee.. 


Poet |? fe 
ile byte) \ ac : 


fala 
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Configuration Logic Connections 

Because of their byte gathering capability, each buffer 
device internally maintains a record of each valid byte in an 
address/data pair. To do this, each device must have a way of 
determining which data bits within a word it is handling. The 
following signals determine how the write buffers handle data 
that is written to the devices: 
* Position 1, Position 0 - these signals (in conjunction 
with Big Endian) determine how each Write Buffer 
decodes the Address 1/0 and AccType 1/0 to determine if 
it should store the data inputs. Refer to Figure 3 for an 
illustration of how data bits are assigned to Write Buffer © 
devices based on their position. 
Big Endian - When asserted, byte 0 is the leftmost, most 
significant byte (big-endian): when deasserted, byte 0 is 
the rightmost, least-significant byte (little-endian). 
Address 1, Address 0 - these signals (taken from the 
AdrLo bus) must be connected to all buffer devices since 
they determine which byte within a word is being 
accessed. 
AccType 1, AccType 0 - these inputs signals specify the 
data size of a write operation as shown in Table 1. 


e 


Table 1 shows how these signals operate to specify how 
bytes are saved within the Write Buffers. 


Bytes Accessed 


___Big-Endian- 0137___Little-Endian 
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Table 1. Byte Specifications for Write Operations 


The lower two address bits of the device in position zero 
(as determined by the two POSITION inputs) are inhibited; 
that is, they are not stored directly as they are output on the 
AdrLo bus. Instead, on output, the lower two address bits are 
generated from the indication of the positions of the valid data 
bytes as determined by above table. 


MatchOut/Matchin Logic and Read Conflicts 
Whenever the processor references main memory (either 

a write or a read reference), the Write Buffers compare the 

word address from the CPU with the word addresses stored 


in the buffers. If any word address matches, the buffers assert 
signals that can be used by the main memory controller to 
ensure that the Write Buffer is emptied before the read access 
with the conflicting address has been performed.. | 

Figure 7 illustrates the Write Buffer signals involved in 
address comparison logic. Each write buffer provides four 
output signals (MatchOut A, B, C, and D) which correspond to 
the four buffer ranks (A, B, C, D) in each device as shown in 
Figure 1. These MatchOut signals can be externally NANDed 
as shown in Figure 7 to determine if the address being input 
matches those in any rank of the Write Buffer. 





IDT79R3020 
RISC CPU WRITE BUFFER 


Write Buffer 3 


MatchinA MatchOutA 
MatchOutB 
MatchOutC 
MatchOutD 


MatchInC 
MatchinD 


Write Buffer 2 


MatchIinA MatchOutA 
MatchInB MatchOutB 
MatchOutC 
MatchOutD 


Write Buffer 1 


MatchInA MatchOutA 
MatchInB MatchOutB 
MatchinC MatchOutC 
Match!nD MatchOutD 


Write Buffer 0 


MatchInA 
MatchInB 


MatchOutA 
MatchOutB 
MatchOutC 
MatchOutD 


Figure 7. Write Buffer MatchOut/Matchin Logic 


The outputs of the NAND gates are fed into Write Buffers via 
the Matchin A, B, C, and D signals and are used within each 
device as part of the byte gathering logic. The NAND gate 
outputs can be NANDed together as shown in Figure 7 with 
the resultant signal used (in conjunction with the processor’s 
MEMRD signal) to alert the main memory controller logic that 
there is a pending buffered write that conflicts with a just- 
issued read. The main memory controller can then delay the 
read access until the Request signal is deasserted indicating 
that the Write Buffer has been emptied. 


AddresslIn 


LatchErrAddr 
EnErrAddr 


Dataln 


OutEn 









: Error Address 
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MatchInA 


MatchInB 
@ 


MatchinC 
CO) 


MatchInD 
@ 


To Main Memory 
Controller 


CONFLICT 


5002 drw 08 


Error Address Latch 

_ The write buffer incorporates an internal latch that can be 
loaded with one of the buffered addresses and subsequently 
enabled out onto the data lines. This feature can be used by 
error handling routines to read an address back from the Write 
Buffer and analyze or recover from certain bus errors. Figure 
8 shows the signals involved in operation of this latch. 

When the LatchErrAddr signal is asserted, the address 

currently available to the address outputs of the Write Buffer 
is latched into the internal latch. This address can then be 
output on the DataOut lines by asserting the EnErrAdr signal 
so that the processor can read the address in as data. Refer 
to the AC specifications for timing parameters of the signals 
associated with the error address latch. 


AddressOut 








Latch 





DataOut 
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Figure 8. The Write Buffer Error Address Latch 
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ABSOLUTE MAXIMUM RATINGS": 3) __ RECOMMENDED OPERATING 





TEMPERATURE AND SUPPLY VOLTAGE 


| AMBIENT | 
TEMPERATURE Vcc 
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VTERM Terminal Voltage 
| with Respect to GND —0.5 to +7.0 
TA Operating | 
| Temperature | Oto +70 ° | 
TBIAS Temperature 
| Under Bias —§5 to +125 e 
TsTG Storage 
| | Temperature®) —55 to +125 : 


Cc 
Input Voltage —0.5 to +7.0 lt SAE i 
NOTES: 5002 tbl 01 
1. Stresses greater than those listed under ABSOLUTE MAXIMUM RAT- 
INGS may cause permanent damage to the device. This is.a stress rating 


SYMBOL RATING 


V 

C : 
Cc : 
a OUTPUT LOADING FOR AC TESTING 
7 | | 


+4mA 








only and functional operation of the device at these or any other conditions WEE » To Device 
above those indicated in the operational sections of this specification is not + BV EDF Under Test 
implied. Exposure to absolute maximum rating conditions for extended a 7 P 

periods may affect reliability. 7 Ranh: “= 


2. VIN minimum = —3.0V for pulse width less than 15ns. VIN should not ex- 5002 drw 10 


ceed Vcc + 0.5V. 
3. Notmore than one output should be shorted ata time. Duration of the short 
should not exceed 30 seconds. 


DC ELECTRICAL CHARACTERISTICS — 
COMMERCIAL TEMPERATURE RANGE (1a = 0°C to +70°C, Vcc = +5.0 V + 5%) 


| ~-146.67MHz | 20.0MHz 40MHz 
SYMBOL PARAMETER TEST CONDITIONS |Min. Max.|Min. Max. Min. Max.| UNIT 


1 
Vou S518 
.___| Output LOW Voltage | Voc= Min. loL=4mA_ | — 0.4 | — 0.4 | i. 04 | 
fvin_ | inputHiGH Voltage) | it — [24 —~ [24 — [24 — 24 —] v_ 










O 5 
oO en 
IH saad 
lL z pas OR 
IN| Input Capacitance(®) | 10 
[Cour | OutputCapacitancel [| 10 I 
floc | Operating Current [Voc=Max. | — 50 
fii | Input HIGH Leakage | Vi! = Voo al ee 
Vit= GND |-10_ — 


: 5 
7 
V (1) | | 
[Vu | InputLow Voltage | 
et Saree i: 
| 0 

0 

10 | — 

fioz | Output Tri-state Leakage -40 40 


z 


omen 
NOTES: : 


1. ViIH should be held above Vcc + 0.5V. 
2. ViL Min. = —3.0V for pulse width less than 15ns. ViL should not fall below —0.5V for longer periods. 
3. Tested only initially, and after design changes which may affect capitance. 
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AC ELECTRICAL CHARACTERISTICS — 
COMMERCIAL TEMPERATURE RANGE (Ta = 0°C to +70°C, Vcc = +5.0V + 5%) 


16.67MHz | 20.0MHz 40MHz 
PARAMETER Min. Max. |Min. Max. 
a ae 


SYMBOL Min. Max.) UNIT 
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| 
mops rN 
Lit yl | 


+ 
— 
“” 


Addrin (3:0) to Clock falling setup 


Addrin (3:0) from Clock falling hold = 
Address 1:0 to Clock falling setup eh 

| Address 1:0 from Clock falling hold 
Access Type 1:0 to Clock rising setup ie, ert ee aed 
Access Type 1:0 from Clock rising hold 


Addrin (7:4) to Clock rising setup 

Addrin (7:4) from Clock rising hold 
Dataln (8:0) to Clock rising setup 7 —-|]|5 — 
Dataln (8:0) from Clock rising hold 3 —-;]3 — 
WrtMem to Clock rising setup 


WrtMem from Clock rising hold oe ee 4 


Request from Clock rising — 32 /]— 30] — 
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| 
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mM fA fw fw 
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| 
pm] 
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= 
= | N 
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| | | 
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| | | 
fe fe fe fe de a fe fe fe fe ee fe fa fe fea fe fe ea fs fe fe 


nh 
ne) 
| 
— 
oO) 
| 
4 
oO) 
=) 
” 


Acknowledge to Clock rising setup 12 — |11 — ns 


Eee 
Acknowledge from Clock rising hold [7 —|e —|[5 —| 
LatchErrAdr to Acknowledge rising fs = | 6 S16 
| WbFull active from Clock rising pa et fies. 1 a7" 


ey) 

| 
Go | oo 
||| 


ns 








18 | WHFUTI inactive from Clock rising }— a1 }— 19° 


119 OutEn to AddrOut (7:0), DataQut (8:0) valid 2 15 


t20 | OutEn to AddrOut (7:0), DataOut (8:0) tri-state f2 15] 2 16 | 2 15 | 


t21 MatchOut (ABCD) from Clock rising 
t22 Matchin (ABCD) to Clock rising setup 10 — 


123 | Matchin (ABCD) from Clock rising hold a es 


t24 | EntrrAdr to Data (error latch) valid 2 15 


t25 EnErrAdr to Data (error latch) tri-state 2 15 |] 2 15 


t26 Address/Data out from Clock rising — 
3 = H 
— 22 1 | 
— 20 
— 28 
— 21 
— 32 
— 32 


NO ie) 
| 
Nh Th 
—h — 
NM TM 


12 
12 


ns 
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ns 


ns 
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uo 
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nn 


NO 7h 
= joa 
ory mM 


ns 


nN 


15 ns 


a 
Oo 
| 
—_ 
(ep) 


ns 





=) 
” 





at 10 os 
t27 Reset to Clock rising, set-up cee ae eee 
2 

2 


” 


t28 Reset from Clock rising, hold 


° 

< 
Q 
0) 
” 


t29 Reset LOW pulse width 





| = 
= | 
Nh Ss —_ 
oo Ro a | 
n 


1380 
131 


WbFull HIGH from Clock rising (after Reset) 


ras 
ne 
— 2 [— «1 |— 
[=z [= 26 [= 25 |— » |[— 
i 
i 
=e [= «|= #[- = |— 
[a9 2000 


132 Access TypOut 1:0 from Reset LOW OutEn Asserted ns 
133 MatchOut (ABCD) LOW from Reset LOW 


t34 Address/Data out tri-state from Reset LOW 
(OutEn negated) 


n 


wn 


po f= 
Qiao 


” 


tcyc | Clock Pulse Width 60 2000] 50 2000] 40 2000] 30 2000] 25 2000 


| tckhigh | Clock HIGH Pulse Width oa Bee. ee We ae — 
tcklow _| Clock LOW Pulse Width 4 —|20 —|]16 —|12 —]10 —]n 


Access TypeOut from Clock rising 


ah 
— 
© 

nN 

Gd 


gi 
oO 
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i) 
—- 
fox 
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Clock 


_ WIMEM 


_ Adrin3:0 
(AdrLo) 


Address 1:0 
AccType 1:0 


Adrin 7:4 
(Tag) 


Dataln 


_ REQUEST | 
t15 


Acknowledge 


LatchErrAdr 
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a Figure 9. Write Buffer Timing Specifications 


Clock 


WTMEM 


REQUEST 


WbFull 
t17 t18 


| Acknowledge 
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Figure 10. WBFULL Signal Timing Specificaions 
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Figure 11. OUTEN Timing Specifications 


Clock 


WTMEM 
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(ABCD) 
t21 


Matchin 
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t22 t23 


EnErrAdr 





DataOut 


Error Latch Data Out 
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Figure 12. Match and Error Latch Timing Specifications 
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Address/Data as 


REQUEST 


Acknowledge - 


AccTypOut eal 
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Figure 13. Address/Data Out, Access Type Out 
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tCYC 
Clock fo | tCKHIGHN ae 
t29 ji ese 
Reset , t27| t28 
OutEn 
Addr/Data | 
Out |) 
MatchOut . | _\ 
AccessTypOut aay 
(0:1) a 
Request ay | 
130 | | 
ss bei 
WBFull Saar ae ae 5002 diw 16 
Figure 14. Reset Timing 
Reset 


OutEn 
. <+—> | 32 
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Figure 15. Reset Timing for Access Type Out 
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68-PIN CPGA FOR 3020 
PIN GRID ARRAY (CERAMIC) — BOTTOM VIEW 


ACC: AC- ADD- CLOCK DATA- DATA- DATA- DATA- VCC2 
TYPO KNOW- | RESS1 INO IN2 IN4 ING 
LEDGE 
GND1 VCC1 ACC- ADD- BIG- DATA- DATA- DATA- GND2 DATA- | 
TYPE1 RESSO | ENDIAN IN1 IN3 IN5 IN7 
ADDR- ADDR- DATA- ADDR- 
OUT5 OUT4 IN8 INO | 
ADDR- ADDR- ADDR- ADDR- 
OUT3 OUT2 IN1 IN2 
ADDR- ADDR- ADDR- ADDR- 
OUT1 OUTO IN3 IN4 
DATA- DATA- ADDR- ADDR- 
OUT8 OUTO IN5 ING 
DATA- DATA- ADDR- | LATCH- 
OUT1 OUT2 IN7 ERR- 
ADR 
DATA- ADDR- MATCH- | MATCH- | 
OUT3 OUT6 INA INB 
ADDR- ACC- MATCH- | MATCH- 
OUT7 TYPE INC IND 
OUT1 
ACC- GNDO DATA- DATA- RE- MATCH- | MATCH- | RESET POSI- VCC3 GND3 
TYPE OUT7 OUT4 QUEST OUTC OUTA TIONO 
OUTO 
VCCO DATA- DATA- | WB-FULL | MATCH- | MATCH- | WIMEM | POSI- OUTEN 
OUT5 OUT6 OUTD OUTB TION1 
1 2 5 6 7 9 
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PIN CONFIGURATION bade. aed PoE - 
PLASTIC LEADED 55 bb BOBO B. 66S 
CHIP CARRIER of P2295 BSSGcsb GbE ER o 
(TOP VIEW) SkREE Elo <<< SlE/f 386 8 
>aadadals SSfi2Zz caajos 
LOMO ANA AAA A 
vsso [_| - — r_] vss3 
ACCTYPEOUTO [__ [—_] MATCHIND 
ACCTYPEOUT1 [J [—] MATCHING 
_ ADDROUT7 [] MATCHINB 
ADDROUT6 [_ [—] MATCHINA 
~ DATAOUT3 [_| |] LATCHERRADR 
DATAOUT2 [_] || ADDRIN7 
DATAOUT1 [1 -—_] ADDRING 
DATAOUTO [__ -—] ADDRINS 
DATAOUTS [_] -—] ADDRIN4 
ADDROUTO [__ -—] ADDRIN3 
~ ADDROUT1 [_ |] ADDRIN2 
ADDROUT2 [_1 -—_] ADDRIN1 
ADDROUT3 [—_] ADDRINO 
ADDROUT4 [__] [—_] DATAINS 
ADDROUTS [_| -—] DATAIN7 
vss1 [=] vsse2 


SHHORHRE§ FZZ222222 9 — sods 
Oe A 9H F044 424424420 
ae ae mT sx tCkrEerErF ERE REE 
oowv £&F FA 00 ct cee eee 
00S 28 G reagaaaqaaaaada 
<<t#Oacnmn GG 
Zz uJ 
Wa z 
O Lu 
< 
ORDERING INFORMATION 
IDT XXXX XX X X 
Device Speed Package Process/ 
Type Temperature 
Range - 
anwar Blank Commercial 
‘G: 68-Pin PGA 
‘J 68-Pin PLCC 
'16' 16.67 MHz 
'20' 20.0 MHz 
'25' 25.0 MHz 
"33" 33.33 MHz 
'40' 40.0 MHz 
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DRAM CONTROLLER FOR PRELIMINARY 





THE R3051 FAMILY IDT79R3721 





Integrated Device Technology, Inc. 





FEATURES ¢ Supports all bus transfers of the R3051 family 
- Highly integrated DRAM Controller for R3051 Family — Page mode reads and writes 

Systems — Quad word reads 

— Direct control of DRAM data path transceivers — Normal read or write accesses 


— Direct handshake with R3041/51/81 84-pin PLCC 
— Direct contro! of DRAMs, including address, RAS, Supports page mode operation of DRAMs (either read or 
and CAS write) using on-chip page hit detector 
- Wide variety of DRAM subsystems supported Automatic CAS-before-RAS refresh with on-chip Refresh 
— 256K x 1 through 4MB x 4 DRAM devices timer | | | 
— 1 to 4 banks of DRAM Supports various system address decoding schemes 
— non-interleaved or two-way interleaved 
— DRAM access times of 100ns or faster 
— Capability to drive up to 36 DRAMs directly 


e 


e 





ALE 
Po conan 
a 
Data Path 


















Control — 
——$ —_____———— Dey 2) 
ll aa re Sean a 
a el Ea 
; Bank(3:0) 
Page Mode Byte Write | 
Comparator 
A/D(25:0) Mode 
Addr(3:2) Register 
Rd 
_Wr Timing 
RdGEn Family Paes 
Burst/WrNear EL ellie RAS/CAS RAS(3:0) 
SysClk Generator reawNaYic Waly 
MSel CAS(3:0) 
CS DRAM Access 
Control 
Refresh 
Controller Shae 
Figure 1. R3721 Block Diagram 
The IDT logo is a registered trademark and 
RtSControlier, RISChipset and R3051 are trademarks of integrated Device Technology, Inc. 
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INTRODUCTION 


The R3721 isaDynamic RAM Memory Controller, designed 


to offer the same levels of system flexibility as the R3051 
family. | 

The R3721 is responsible for translating between the 
R3051 family bus interface and the special control requirements 
of various DRAM based sub-systems. The R3721 performs all 
necessary handshaking and timing control. All that is required 
to implement a DRAM sub-system for the R3051 family is the 
R3721, DRAMs, an address decoder, and some transceivers 
forthe datapath. =» : 

The R3721 has been designed to enable systems to be 
implemented with field upgrade capabilities of their memory 
system. In order to upgrade to larger memory devices, or to 
increase the amount of memory, software merely needs to re- 
program the R3721 moderegister at boottime. No complicated 
re-routing of address lines, nor modifications of the data path 
need to occur. Thus, as with the R3051 family, a single 
footprint and base design can offer a wide variety of end 
products, depending on the frequency of devices selected, the 
amount of memory installed, and the specific R3051 family 
CPU selected. 

Figure 1 illustrates the block diagram of the R3721 DRAM 
controller. 


The R3721 DRAM controller contains all of the functional 


elements necessary to support the bus transaction 
requirements of the R3051 family. The R3721 connects 
directly to the R3051 bus, and captures address and control 
information from the bus as the R3051, or a DMA controller, 
drives it. The R3721 begins its transaction once its memory 
space is selected by an external address decoder. 

The R3721 will generate all of the DRAM control signal 
sequencing required: 

- Row address set-up to RAS asserted. 

¢ Row address hold from RAS asserted — 
¢ Column address set-up to CAS asserted 
- RAS to CAS delay 

- CAS to data valid (read) 

¢ WE to CAS set-up (write) 

In addition, the R3721 will manage the transceiver-based 
data path interface, to properly control the flow of data between 
the CPU bus and the DRAM devices. The R3721 can either 
control standard FCT245 type transceivers (non-interleaved 
memory systems), or use the high-performance 73720 Bus 
Exchanger (for interleaved memory systems). Figure 2 
illustrates a typical system composed of the R3051, R3721 
DRAM controller, and 73720 Bus Exchanger. 

Finally, the R3721 will provide the proper acknowledgement 
back to the R3051, at the optimum time. That is, the R3721 
will generate Ack and/or RdCEn, according to the timing 
model for the DRAMs and the type of transfer requested. 

This data sheet provides an overview on the R3721, and 
also includes specific electrical and mechanical information 
on the device. A detailed understanding of the R3721 and its 
uses canbe obtained fromthe A3721 Hardware User's Manual, 
available from your local IDT sales representative. 





| DRAM Address 


| Bank Sel(1:0 A25A(25:24) | ssid 


R3721 PROCESSOR INTERFACE 

The R3721 is designed to reside directly on the R3051 
family A/D and control busses. To complete the system 
design, an external address decoder is required, and external 
data path chips such as the IDT73720 Bus Exchangers, or 
IDT74FCT245 bi-directional transceivers. : 

Regardless of size or organization of DRAM, the R3721 is 
always connected to particular bits of the R3051 A/D bus. The 
R3721 uses programmed values for the DRAM size and 
organization to internally multiplex R38051 address lines into 
the appropriate row and column addresses for the DRAM. 
Table 1 shows the internal multiplexing of addresses performed 
by the R3721. Table 2 shows the DRAM bank selection, and 
which RAS/CAS control signals are output. 

The R3721 monitors the processor ALE, Rd, Wr and Burst/ 
WrNear control signals to determine the type of cycle in 
progress. The R3721 contains its own address latches, and 
alignes processor address outputs with DRAM Row and 
Column addresses. 

lf the external address decoder indicates that this transfer 
is intended for the DRAM sub-system, the R3721 performs the 
DRAM control interface. At the appropriate time, the DRAM 
controller will return the RdCEn/Ack handshake back to the 





‘processor to indicate that the transaction is sufficiently 


completed. 

The interface to Ack and RdCEn is performed using a tri- 
stateable output driver. This allows other tri-stateable sources 
to directly drive Ack and RdCEn without introducing 
combinatorial logic delays inherent in combining the 
acknowledgements from multiple memory subsystems. 





Interleaved 
A(11:3) 
| Row(8:0) | A012) 


Address assignment for 256k x 4 and 256k x 1 DRAMs 


Tcotumn(ec) | Ata) | AGi2)——*s 
P Bank Sel) | _Azsazsay [SSCS 


Address assignment for 1M x 1 and 1M x 4 DRAMs ° 


DRAM Address Non-interleaved | 
Column(10:0) A(13:3) A(12:2) 
| Row(10:0) A(24:14) |} A(23:13) 
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Address assignment for 4Mx1 and 4Mx4 DRAMs 
Table 1. Processor to DRAM Address Multiplexing 


|_Non-Interleaved _| 
|_RAS(0V/CAS(3:0) _| 
[RAS(IVCAS@.0) | RAS(:O/CAS(S:0)_| 
| _RAS(2)/CAS(3:0)_| RAS(3:2\/CAS(3:0) | 
RAS(3:2)/CAS(3:0) 


Table 2. Bank Selection in Multi-Bank System © 9051 tbi04 


9051 tbl 03 










Bank Sel(1:0 Interleaved 


RAS(1:0)/CAS(3.0) 


) 











01 
10 
14 








6.4 . 2 


IDT79R3721 DRAM Controller COMMERCIAL TEMPERATURE RANGE 








IDT R3051 Family 
RISControllier 






Address/ 
Data 


Control 








R3051 
Local Bus 










feta 


IDT 79R3721 
DRAM 
Controller 









Address Decode 





/O 





erbeaceeeececetetetocetetotebebetetscetetetoteshcesssesncececnoeceororhbaesnaesebenecenvanistecd#testeretetatosetatadatatesatetetatetctntetatettetetetetaterstitetststetet 


IDT 73720 
Bus Exchanger 





SSS Kes Ne SS 


ory 
setae! J ane, 
SS SPARED PPP TAPE na AERO TEPER ESET SSRIS ETO IE RETESTED OT TENS TEETER 


Figure 2. R3721 Used in R3051 System 
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R3721 DRAM INTERFACE 


The R3721 has been designed to interface to a wide variety 
of DRAM subsystems. Various options include: 
¢ Interleaved vs. Non-Interleaved 

Interleaved memory subsystems offer higher system 
performance by providing higher bandwidth to the processor 
during quad word refills. However, an interleaved memory 
system requires a larger “base” amount of memory (two 32- 
bit arrays minimum) and a wider data path (one for each 
array, time multiplexed onto a single CPU bus). 

The R3721 offers the system designer the flexibility to 
design either type of memory system. In fact, with proper 
planning, the system designer can offer a base model that 
does not perform memory interleaving, but allow field 
upgrades to perform interleaving (thus increasing both the 
memory and raw performance of the eyeleM): | 
Various densities of DRAM 

The R3721 allows the system designer to use DRAM 
densities from 256k x 1 through 4M x 4. Thus, depending 
on the memory requirements of the application, the system 
designer can decide the appropriate memory subsystem for 
the application. In addition, the DRAM controller internally 
aligns the CPU address bus with the DRAM address lines; 
this allows a later field upgrade to increase the density of 
memory devices used without requiring jumpering of address 
lines. The R3721 performs internal multiplexing of address 
lines in order to support varying densities of DRAMs, 
without changing its interface to the processor bus. 

Single bank or multiple banks of memory 

The R3721 allows systems to be constructed with one to 
four banks (32-bit wide memory arrays) of memory (either 
interleaved or not). Obviously, ithas been designed to allow 
various strategies of “field upgrades’ in the DRAM memory 
sub-system. 

The R3721 utilizes a high-performance output drivers, and 
four sets of the RAS and CAS DRAM controls, to directly 
drive up to 36 DRAM devices. The R3721 uses a high- 
power output driver with built-in series resistance to avoid 
the noise problems typically associated with driving large 
capacitive loads. 

In addition to the capability to directly drive these large 
loads, the R3721 also allows the system designer to 
incorporate additional, external memory drivers if needed. 
The various timing options supported can be selected to 
accomodate the additional delay of buffer drivers in the 
DRAM subsystem. 

The R3721 takes care of the particular case of partial 
writes. CAS(3:0) are used to provide selective enabling of 
those DRAMs being written; that is, only those byte lanes 


involved in the write will have their corresponding CAS 


signals asserted. 


- Intelligent Control interface to take advantage of Page 
Mode DRAMs 
The R3721 state machine was aeuigned after extensive 
simulation of R3051 program behavior. Optimizations around 
typical locality of reference are included in the state machine 
for the R3051. 


Figure.3 shows the basic state machine for the R3721. 
Note that it is optimized for series of page mode DRAM 
accesses. 

Specifically, page mode is used for: 
— Burst Refill. 

Page mode is used to obtain words within a quad word 
read. However, simulation has shown that the most likely 
next transfer is a single word write; thus, RAS and CAS are 
negated at the end of the burst refill to minimize the latency 
of subsequent operations due to RAS pie: charge. 

— Single Reads. 

After a single read, the DRAM controller will leave the 
DRAMs expecting a subsequent page mode access to the 
same page (either another read, a write, or a burst refill). 
The R3721 includes an on-chip page comparator which 
uses the DRAM density programmed into the device to 
determine whether or nota given access can take advantage 
of page mode. 3 

— Single Writes. 

After a single write, the DRAM controller will leave the 
DRAMs expecting a subsequent page mode access to the 
same page (either another write, a read, or a burst refill). 
The DRAM controller can use either WrNear, or its internal 
page comparator, to detect opportunity for page mode 
accesses. 

Thus, the R3721 has truly been optimized to the operating 
environment of the R3051 based systems. 
¢ Various speeds of DRAMs and Processors 

The R3721 has been designed to support a wide range of 
processor frequencies, across a wide range of DRAM 
speeds. The system designer can configure varying times 
forthe DRAM control signals. Programmable DRAM control 
parameters include: | 

— RAS to CAS Delay. 

This allows the system designer to control a number of 
critical timings, including row address hold time from RAS 
and the RAS to CAS delay requirements of the system. 

— RAS and CAS pulse widths. | 

These parameters directly control the access time of the 

DRAM, and the resulting system performance. 
— RAS and CAS pre-charge times. 

These parameters allow the system designer to minimize 
the performance penalty of DRAM pre-charge, yet still 
insure proper system operation. 
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— Refresh period. 

Depending onthe system speed, the DRAM controller will 
establish the appropriate counter value to insure both 
proper refresh operation, and to insure that the maximum 
RAS low time of the DRAM is not violated. The R3721 uses 
a CAS-before-RAS refresh protocol to perform DRAM 
refresh. 

— Address decode time. 

The DRAM controller can work in systems which can 
properly decode addresses within the first cycle of atransfer, 
for optimal performance. Alternately, the DRAM controller 
can work with slower systems, requiring an extra cycle to 
perform proper address decoding. 


Reset 


¢ Various data path options. 

The R3721 directly controls the data path between the 
CPU and the DRAM sub-system. The R3721 can control 
either aset of IDT74FCT245s (for non-interleaved memory 
systems) or IDT73720s (for either multiple banked or 
interleaved memory configurations). 

The R3721 allows this variety of options through the use of the 
on-chip MODE register. 
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Figure 3. R3721 DRAM Control State Machine 
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THE MODE REGISTER 

The mode register is a 16-bit write-only register used to 
configure the R3721 to adapt it-to a variety of different 
applications. Figure 4 illustrates the mode register. The settings 


of the mode register influence the signals used to control the 
external DRAM banks as well as the signals involved in 
controlling the data path. 


15 14 13 12 11 10 9  #£«8 7 6 of 4 3 2 1 0 
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Figure 4. R3721 Mode Register 


PROGRAMMING THE MODE REGISTER 

The mode register contains different fields that provide the 
R3721 with great flexibility in interfacing with a wide range of 
applications. Each field is used to control one aspect of the 
behavior of the R3721. All the fields get updated when writing 
to the mode register. 


DRAM PAGE SIZE FIELD 


Bits 0 and 1 of the mode register are used to inform the 
R3721 of the organization of the DRAMs used in the system 
as follows: 

This allows the R3721 to control up to a maximum of 64 
MBytes of memory in the “X1” configurations and up to a 
maximum of 16 Mbytes in the “X4” configurations. 


DRAM Page Size 
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EXTERNAL MEMORY CONFIGURATION 


Bit2 of the mode register are used to program the physical 
configuration of the external memory and the data path. 

The R3721 always assumes that Bus Exchangers are used 
in the data path for the interleaved configuration. In the Non- 
Interleaved configurations, it is possible to connect either 
standard transceivers or Bus Exchangers. 





Non-Interleaved memory system 


Interleaved memory system and Bus Exchangers 
are used in the data path. 


Memory Configuration 
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WRITE NEAR 


The R3721 has the ability to use the R3051 WrNear output 
to provide fast page mode writes. The extra delay may be 
appropriate in certain memory configurations, as discussed in 
User's Manual. 


SS ae Use of WrNear is enabled | 
Use of WrNear is disabled 
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RAS TO CAS DELAY 


Bit 4 of the mode register specifies the delay between the 
assertion of the appropriate RAS signal to the assertion of the 
related CAS signal. This delay can be programmed to be 
either one clock cycle or two clock cycles. Figure 5 illustrates 
the effect of the RCD bit . 

The DRAM controller always transitions the DAddr bus 
from Row Address to Column Address one-half clock cycle 
before the assertion of CAS. 


Bit4 
RCD RAS to CAS delay | 


| 0 ~~ | One clock cycle delay from RAS to CAS 














1 Two clock cycles delay from RAS to CAS 
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One clock cycle 








SYSCLK 
RASn ; 
CASn : | One Clock 
DAddr Row Addr aa : Column Addr : 
CASn 
DAddr | Row Addr : X : Cplumn Addr 
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Figure 5. RAS to CAS Delay 
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RAS TIMING 


Bits 5, 6 and 7 of the mode register specify the width of the 
RAS pulse in clock cycles as well as the RAS pre-charge time. 
This field gives the system designer the freedom to choose 
from a wide range of DRAM speeds based on a performance/ 
cost criteria. Figure 6 illustrates the timings of the RAS signals. 





One Clock 
FRSA fe Rh reg | FS PRE CHARGER? Clk 
os RAS PRE-CHARGE~3 Ck 
RASn 3 Clk 
9051 drw 06 


RAS PULSE = 4 Clk 
RAS PRE-CHARGE=4 Clk 


Figure 6. RAS Timing 
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One Clk 


SYSCLK 


CASn 


CASn 


CAS PULSE WIDTH 


Bit 8 of the mode register specify the CAS pulse width in 
clock cycles. The CAS pulse width can be programmed to be 
1.5 or 2.5 clock cycles. Figure 7 illustrates the timings of the 
CAS pulse width. 

The CAS pulse width, along with the CAS precharge time, 
has the most dramatic impact on system performance. These 
parameters affect the performance of the various page mode 
accesses performed by the DRAM controller, and thus directly 
affect the timing of the RdCEn and Ack acknowledgement 
signals back to the processor. 










CAS Pulse Width 
ee ee 2.5 clock cycles 
ee 1.5 clock cycles 
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Figure 7. CAS Pulse Width 
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CAS PRE-CHARGE TIME 


Bit 10 of the mode register specifies the CAS pre-charge 
time which could be programmed to be either 0.5 clock cycle 
or 1.5 clock cycles. Any combination between the CAS pulse 
width and the CAS pre-charge time is possible. Figure 8 
illustrates the CAS pre-charge timing. 


1 Clock 


SYSCLK | | | | | | 

i kept ck | -_— 
AAG, I | I ] 
CASn ! Lo I I 
| “a “ oe 1.5 Clk | 
CASn | ano 

| | 

] | 

I I 


! 
, | 9051 drw 08 | 







Figure 8. CAS Pre-charge Timing 
CAS Precharge . 


Bit 10 
CP Width 
15 clock eye 
15 clock evel 
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REFRESH PERIOD 

Bits 11, 12 and 13 of the mode register specify the frequency 
of the input clock to the R3721. The R3721 loads an internal 
refresh timer with the appropiate value to refresh the DRAMs 
according to the table below. 

The value is appropriate to avoid violating the maximum 
RAS low time of 10uS specification for DRAMs. Using this 


value, the DRAM controller will insure RAS does not stay low 
too long, by performing a refresh cycle. — ; 


Bit 11 Timer SysClk 
RFO Value Freq. 









To | 23 | 4MHe 
[1 see 
ee 
To [176 | 20MHe 
25 Miz 
a ee 
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DELAYED CHIP-SELECT 


Bit 14 of the mode register specifies when the R3721 will 
sample the Chip-Select and/or the Mode-Select input pins at 
the beginning of any access. The R3721 can be programmed 
to sample the Chip-Select on the first positive edge of the clock 
following the negation of ALE or on the first negative edge of 
the clock following the negation of ALE. 

This bit allows the R3721 to perform optimally in either a 
high-performance (or low frequency) system capable of rapidly 
decoding addresses, or in systems using a slower, or 
synchronous approach to address decoding. The R3721 
needs to also be explicitly aware of transfers which do not use 
its memory devices; for example, it can use these cycies to 


perform a DRAM refresh without performance loss in the 


system. | 

The DCS bit also affects the operation of the R3721 for 
page writes. If the DCS is cleared, the R3721 can perform 
page writes in a minimum of two clock cycles. If the DCS bit is 
set, the R3721 can perform page writes in a minimum of 3 
clock cycles. Figure 9 illustrates the timings of the Chip- 
Select or the Mode-Select input pins. 


CS sampled on the positive edge of the clock 2 clock | 
cycle page writes may be possible | 


1 + | CS sampled on the negative edge of the clock 2 clock 
cycle page writes not possible : 
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Figure 9. Delayed CS Settings 
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NOTE: 
1. Rsvd bits must be written with "0". 
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DEFAULT SETTINGS * 4 clock cycles for the RAS pulse width and the RAS pre- 


Atpowerup, the mode register isloaded withdefaultvalues Charge time 
which correspond to the following system: * 2.5 clock cycles for the CAS pulse width 
- DRAM page size: 512 entries ~* 1.5 clock cycle for the CAS pre-charge time 
* System configuration: Non-interleaved ° 25 Mhz frequency of operation 
- WrNear for fast writes enabled. * Delayed Chip-Select. 


* 2 clock cycles delay from RAS assertion to CAS assertion 
Figure 10 illustrates the settings of the mode register at 


power up. 





| PEE EE 


Di5 D114 D13 D12~~=—~D11 D10 D9 D8 D7 D6 D5 — D3 D2 D1 DO 
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Figure 10. MODE Register Default Values 


NOTE: | 
1. Rsvd bits must be written as "0". 


WRITING TO THE MODE REGISTER The R3051 can access the mode register by proceeding 
ne with a standard write operation to the I/O location occupied by 
the mode register. The R3721 detects the assertion of both the 
CS and the MSel lines and determines that the access is for 
the internal mode register. The data present on the R3051 


The mode register is a 16-bit write only register that controls 
the internal operation of the R3721 DRAM controller. The 
different fields of the mode register control the behavior of 


various output control signals such as the RAS and the CAS Me ane a é 
signals. At power up, the mode register is initialized with the data bus A/D(15:0) is written into the mode register, regardless 


default settings illustrated in figure 10. To obtain maximum °f System byte ordering. The R3721 returns the ACK signal to 
performance out of the R3721 DRAM Controller, the mode the R3051 to terminate the write access to the mode register 
register needs to be programmed tofit the applicationathand. i" 3 clock cycles. Thus, the write access to the mode register 

To access the internal mode register of the R3721, the |S always 3 clock cycles regardless of the configuration of the 


external address decoder mustassertboththe CSlineandthe ©xternal memory system. . 
MSel lines. The assertion of the CS line is imporatnt to Note that it is recommended that writes to the mode register 


distinguish among multiple R3721s in a single system. The USe ‘0’ in the upper A/D bits (A/D(25:16)). This insures 
Internal mode register of the R3721 shouldbe mapped inthe Compatibility with possible future versions of the DRAM 
uncachable |/O space of the R3051. controller. 
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PIN DESCRIPTION | | | | 


Reset: An active low input used to reset the DRAM controller state machines. At the end of Reset, 
the R3721 loads the mode register with default values, and performs 15 CAS-before-RAS refresh cycles 
to the DRAMs to initialize them. | 


Address/Data(25:0): These signals are connected directly with A/D(25:0) of the R3051 family CPU. 
The DRAM controller uses these inputs to obtain: . 3 


BE(3:0): Individual data byte enables used in write operations. 


Address(25:4): Address bits used to select amongst banks of DRAMs, and Row and Column addresses, 
according to tables 1 and 2. 


Data(15:0): During Mode register write operations, during the data phase the A/D bus carries the values to be | 
written into the mode register. 


Low Order Address(3:2):These signals carry the word within quad word address currently expected by the | 
processor. During single reads, or writes, these inputs carry the specified address. During quad word reads, the 
DRAM controller uses an internal counter to manage word within quad word addressing, and thus ignores these 
inputs. 
Address Latch Enable: This signal is used to de-multiplex the A/D bus from address to data phase. The R3721 
uses this signal to capture the current value of A/D(25:0) and Addr(3:2) during the address phase. The R3721 
also uses this signal as the indication of the beginning of a memory transfer, and awaits its “Chip Select”, 
according to the timing specified in the mode register. ; 


| Rd si Read: Indicates that the current transfer is a read (single or burst). 
Write: Indicates that the current transfer is a write (near or not). 


Burst: During reads, this signal functions as the “Burst” indicator. If burst is asserted during a read, 
the R3721 knows that a quad word read sequence is expected. 

WrNear: During writes, this signal functions as the “Write Near” indicator. Ifthe DRAM controller isin the “IDLE, 
RAS asserted” state, it may use this signal to retire the write in two cycles. 


System Clock: This is the master timing reference, and is a direct connection from the SysClk output of the 
| R3051 family processor. All timing events are referenced to the SysClk input. 


DRAM Chip Select: This input is provided by the external address decoder, and is used to indicate that this 
R3721 controls the DRAM responsible for retiring this transfer. The R3721 uses the programmed value in the 
Mode Register to determine when to sample this input. 


Mode Register Select: This input is provided by the external address decoder, and is used to indicate that this 
transfer targets the internal mode register of the R3721. To write to the mode register, both CS and MSel must 
be asserted by the external address decader. The R3721 uses the programmed value in the Mode Register to 
determine when to sample this input. 


Read Buffer Clock Enable: This output to the R3051 processor indicates that the currently requested word will 
be available on its A/D bus at the next sampling clock edge (falling edge of SysClk). 


This output is a tri-stateable output; it is only driven by the R3721 in transfers in which its CS input is asserted 
at the proper time. It is internally pulled up, so that no external pull-up resistor is required. : 


















A/D(25:0) 
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SIGNAL DESCRIPTION 


Acknowledge: This output to the R3051 family processor indicates that the R3721 has sufficiently 
processed the current transfer. 
On read operations, the processor uses this information to determine when to begin emptying the read buffer 
into the on-chip cache. The timing of this output during quad word reads is determined by the R3721 for optimal | 
performance. The R3721 will release the processor to begin execution as early as possible in the transfer, but | 
will insure that the fourth word of the quad read is available before the processor obtains it from the read buffer. | 
Thus, the processor can simultaneously execute the incoming instruction stream even while the R3721 obtains | 
| the remaining words of the transfer. 
On write operations, the processor uses this to terminate the write operation. 
This output is a tri-stateable output; it is only driven by the R3721 in transfers in which its CS input is asserted 
| at the proper time. It is internally pulled up, so that no external pull-up resistor is required. | 
DAddr(10:0) i DRAM Address: These outputs are typically connected directly to the DRAM multiplexed row/ column address 
RAS(3:0) ‘ 

| Cc : 










large capacitive loads. In addition, high-drive capability has been incorporated in these outputs. Thus, the 
R3721 can directly drive large numbers of DRAMs or multiple SIMM modules. 


Row Address Strobe: These outputs are directly connected with the RAS inputs of the DRAMs on a bank 
basis, according to table 2. The falling edge of this signal is used by the DRAM to capture the row address 
presented on DAdr(10:0). 


In order to directly. drive multiple DRAM devices, these signals provide high drive, and incorporate series 
resistors. Each RAS signal may drivemultiple loads with no system performance degradation. 


Column Address Strobe: These outputs are directly connected with the CAS inputs of the DRAMson a 
byte basis, according to table 2. The R3051 processor may write partial word quantities, in which case the R3721 
only enables those DRAMsin the byte lane being updated. CAS(3) corresponds to BE(3); CAS(2) corresponds 
to BE(2); etc. The falling edge of this signal is used by the DRAM to capture the column address presented on 
DAdadr(10:0). : 

In order to directly drive multiple DRAM devices, these signals provide high drive, and incorporate series 
resistors. However, the propagation delay of CAS is a system critical parameter: thus, no CAS signal should 
drive more than 8 loads. 


Bank Write Enable: These outputs are used to individually control the write enables of various memory banks. 
In non-interleaved systems, all four outputs are asserted; RAS selects the specific bank to be written. In 

interleaved systems, they are enabled in pairs; that is, writes to an even bank cause WBank(2) and WBank(0) 
to be asserted, while writes to an odd bank cause WBank(3) and WBank(1) to be asserted. Again, only the 
specific bank being written will have its RAS asserted, and thus only that bank will be updated during the write. 


During refresh cycles, these outputs are negated. This avoids accessing the “test mode” built into modern 4MB 
DRAMs. 

In order to directly drive multiple DRAM devices, these signals provide high drive, and incorporate series 
resistors 


DRAM Output Enable: This output is directly connected to the output enable of common I/O DRAMs. Itis 
connected to all DRAMs under the control of the R3721. 















inputs. Depending on the memory system organization and the organization of the DRAMs used, the R3721 
will align the processor addresses with the DRAM addresses according to table 1. 
AS(3:0) 


These outputs incorporate series resistors to eliminate overshoot and undershoot problems associated with 
WBank(3:0) ; 
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[SIGNAL | UO | 7 DESCRIPTION 


DByteEn(3:0) Data Path Byte Enable: These outputs are four identical output enables for the transceivers inthe DRAM data: 
path. Eveninthe case of partial writes, all four enables willbe asserted; CAS will control which devices actually | 
get updated. . “3 
In typical systems, DByteEn is connected on a byte lane basis to evenly distribute the load. For example, if | 
the data path interfaces uses 74FCT245s, then the DByteEn is directly connected to the “OE” input of the 
transceiver on that byte lane. If the data path uses IDT73720 Bus Exchangers, DByteEn(1:0) are connected 
to the Bus Exchanger on the lower half of the data bus (Data(15:0)), and DByteEn(2:0) are connected to the 
Bus Exchanger on the upper half of the data bus (Data(31:16)). : 

Transmit/Receive: This signal indicates the direction of the data path, and is connected directly to the T/R | | 
_ input of the 74FCT245 or IDT73720. This output is HIGH during write cycles, and LOW during reads. | 
_ Path: This signal.is directly connected to the Path input of the IDT73720. It is used to specify the even or 

odd memory bank participating in the current transfer. A HIGH specifies an even bank, and a LOW specifies 

an odd bank. . 

Data Path Latch Enable: This signal is connected to the YLEn and the ZLEn inputs of the IDT73720 Bus 

Exchanger. Itis used to capture the data provided by both banks of memory of an interleaved system, for later 

sequencing onto the processor A/D bus. 
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ABSOLUTE MAXIMUM RATINGS" *) RECOMMENDED OPERATING 
[ symbol[ Rating | Commercial _[ unit | TEMPERATURE AND SUPPLY VOLTAGE 


VTERM_ | Terminal Voltage with —0.5 to +7.0 V | Grade | ‘Temperature | GND | 
Respect to GND 


Commercial 0°C to +70°C | 
Tc, TA | Operating Temperature 0 to +70 °C (Ambient) 
(Ambient) 7 
Temperature Under Bias (—55 to +125 . 


| Tsta | Storage Temperature —55 to +125 
} Vin Input Voltage | -0.5 to +7.0 


NOTES: 9051 thi 17. ° 

1. Stresses greater than those listed under ABSOLUTE MAXIMUM RATINGS | 
may cause permanent damage to the device. This is a stress rating only OUTPUT LOADING FOR AC TESTING 
and functional operation of the device at these or any other conditions 
above those indicated in the operational sections of this specification is not 
implied. Exposure to absolute maximum rating conditions for extended 
periods may affect reliability. 

2. VIN minimum = —3.0V for pulse width less than 15ns. 
Vin should not exceed Vcc +0.5V. 

3. Notmore than one outputshould be shorted ata time. Duration of the short 
should not exceed 30 seconds. 













Vcc 


5.0 +5% 
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To Device . 
Under Test 


AC TEST CONDITIONS 


Parameter 


Input HIGH Voltage 


Input LOW Voltage | 
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9051 tb! 18 CLD is 25pF unless otherwise specified in the AC Electrical 
Characteristics Table. 
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DC ELECTRICAL CHARACTERISTICS— (Ta = 0°C to +70°C, Vcc = +5.0V +5%) 


Symbol Test Conditions 









VOH 


NOTES: 9051 tbl 20 
1. Vit Min. = —3.0V for pulse width less than 15ns. Vit should not fall below —0.5V for larger periods. 

2. Vik should not be held above Vcc + 0.5V. 

3. Guaranteed by design. 

4. Except for Ack and RdCEn. 

5. For DAddr: loH = —64mA, lot = +64mA, for all other signals: loH = —-32mA, IoL = +32mA. 
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AC ELECTRICAL CHARACTERISTICS “1 2:5) (Ta = 0°C to +70°C, Voc = +5.0V +5%) 


Symbol Description . | Test Condition 


SysClk rising to DByteEn negated 50 pF - 
Pus SysCik. falling to OE asserted 300 pF 
300 pF_ 
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CS & MSel hold after SysClk 50 pF 
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NOTES: i 9051 tbl 22 

1. All timings referenced to 1.5V unless specified « otherwise. 

2. The AC values listed here reference timing diagrams contained in the A3721 aiware User's Manual and this data sheet. 

3. Guaranteed by design. 

4. This parameter is used to derate the AC timings according to the loading of the system. This parameter provides a deration for loads over the specified 
test condition; that is, the deration factor is applied for each 25pF over the specified test load condition, unless specified otherwise. 

5. Test Conditions refer to the capacitive load and VoH and Volt level of outputs. Refer to the AC test conditions diagram for more information. 
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PIN CONFIGURATION 
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Figure 11. Single Read in Non-interileaved Memory System 
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Figure 12. Page Mode Read in Non-Interleaved Memory System 
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Figure 13. Single Write in Non-Interleaved Memory System 
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Figure 14. Page Mode Write in Non-Interleaved Memory System 
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Figure 16. Single Read in interleaved Memory 
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Figure 17. Page Mode Read of Interleaved Memory 
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Figure 18. Page Mode Write to Interleaved Memory 
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Figure 19. Quad Word Read from Interleaved Memory 
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Figure 20. Power On Reset 
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Figure 21. Warm Start Reset 
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Figure 22. CAS-before-RAS Reset Sequence 
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Figure 23. CAS-before-RAS DRAM Refresh 
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Figure 24, Input Clocking 
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84 LEAD PLCC (SQUARE) 
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C1 | Le SEATING PLANE 


NOTES: 9051 drw 33 


All dimensions are in inches, unless otherwise noted. 

BSC—Basic lead Spacing between Centers. 

D & E do not include mold flash or protutions. 

Formed leads shall be planar with respect to one another and within .004” at the seating plane. 
ND & NE represent the number of leads in the D & E directions respectively. 

D1 & E1 should be measured from the bottom of the package. 
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Commercial Temperature Range 
84-Pin PLCC 


20.0 MHz 
25.0 MHz 


DRAM Controller 
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Integrated Device Technology, Inc. 


16-BIT TRI-PORT 


IDT73720/A 


BUS EXCHANGER 








FEATURES: 


High-speed 16-bit bus exchange for interous communica- 
tion in the following environments: 

— Multi-way interleaving memory 

— Multiplexed address and data busses 

Direct interface to R3051 family RISChipSet™ 

— R3051™ family of integrated RISController™ CPUs 

— R3721 DRAM controller 

Data path for read and write operations 

Low noise 12mA TTL level outputs 

Bidirectional 3-bus architecture: X, Y, Z 

— One CPU bus: X 

— Two (interleaved or banked) memory busses: Y & Z 
— Each bus can be independently latched 

Byte control on all three busses 

Source terminated outputs for low noise and undershoot 
control 

68-pin PLCC and 80-pin PQFP package 
High-performance CMOS technology. 


FUNCTIONAL BLOCK DIAGRAM 


e 





NOTE: 


1. Logic equations for bus control: 


OEXU = T/R* . OEU*; OEXL = T/R*. OEL*; OEYU = T/R. PATH. OEU* 


DESCRIPTION: 


The IDT73720/A Bus Exchanger is a high-speed 16-bit bus 
exchange device intended for inter-bus communication in 
interleaved memory systems and high-performance multi- 
plexed address and data busses. 

The Bus Exchanger is responsible for interfacing between 
the CPU A/D bus (CPU address/data bus) and multiple 
memory data busses. 

The 73720/A uses a three bus architecture (X, Y, Z), with 
control signals suitable for simple transfer between the CPU 
bus (X) and either memory bus (Y or Z). The Bus Exchanger 
features independent read and write latches for each memory 
bus, thus supporting a variety of memory strategies. All three 
ports support byte enable to independently enable upper and 
lower bytes. 


Y0:7 


Ya:15 
(Even Path) 


PATH 
TR 
OEU 


Z0:7 


28:15 
(Odd Path) 


2527 drw 014 


Figure 1. 73720 Biock Diagram 


OEYL =T/R.PATH. OEL*; OEZU = T/R . PATH* . OEU*; OEZL = T/R.PATH*. OEL* 


The IDT logo is a registered trademark and RiSChipSet. RISController. R3051 are trademarks of integrated Device Technology, Inc. 
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PIN CONFIGURATIONS 


Xg C11 , 59 [11 78 
Xg C412 ~_ Designator 7 58 [ Z7 
x10 G78 : : | HI Z6 
X11 4414 56 O Z5 
X12 O15 -} 24 
x13 O16 P23 
x14 1” | ae Dr) 22 
X15 C418 ry 21 
GND "9. | Fy ZO 
VCC | - LI GND 
PATH 21 . fr] VCC 
OEU G2 | LEXZ 
LEYX C23 ie 3 1 OEL 
LEZX igs 2 | rT LEXY 
YO (25 1 T/R 





Designator. : 





PQFP 
TOP VIEW 
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PIN DESCRIPTION 


X(0:15 /O 






P zo) [v0 



















the direction specified by T/R . 


the direction specified by T/R . 


ABSOLUTE MAXIMUM RATINGS") 


-0.5 to +7.0 | -0.5 to +7.0 
TA Operating 0 to +70 —55 to +125} °C | 
| | Temperature 


| Under Bias | | | 
| Temperature | | 






Terminal Voltage 
with Respect 




















PT Power 1.0 1.0 W 
| Dissipation | | | 
Current 





NOTE: 2527 tbl 03 

1. Stresses greater than those listed under ABSOLUTE MAXIMUM 
RATINGS may cause permanent damage to the device. This is a stress 
rating only and functional operation of the device at these or any other 
conditions above those indicated in the operational sections of this 
specification is not implied. Exposure to absolute maximum rating 
conditions for extended periods may affect reliability. 


Bidirectional Data Port X. Usually connected to the CPU's A/D (Address/Data) bus. 
| (0:15) /O Bidirectional Data port Y. Connected to the even path or even bank of memory. | 
Bidirectional Data port Z. Connected to the odd path or odd bank of memory. 


Latch Enable input for Y-Write Latch. The Y-Write Latch is open when LEXY is HIGH. Data from the X-port 
(CPU) is latched on the HIGH-to-LOW transition of LEXY 


Latch Enable input for Z-Write Latch. The Z-Write Latch is open when LEXZ is HIGH. Data from the X-port 
(CPU) is latched on the HIGH-to-LOW transition of LEXZ. 


Latch Enable input for the Y-Read Latch. The Y-Read Latch is open when LEYX is HIGH. Data from the even 
path Y is latched on the HIGH-to-LOW transition of LEYX. 


Latch Enable input for the Z-Read Latch. The Z-Read Latch is open when LEZX is HIGH. Data from the odd 
path Z is latched on the HIGH-to-LOW transition of LEZX 


Even/Odd Path Selection. When HIGH, PATH enables data transfer between the X-Port and the Y-port (even 
path). When low, PATH enables data transfer between the X-Port and the Z-Port (odd path). 


Transmit/Receive Data. When HIGH, Port X is an input Port and either Port Y or Z is an output Port. When low, 
Port X is an output Port while Ports Y & Z are input Ports 


Output Enable for Upper byte. When LOW, the Upper byte of data is transfered to the port specified by PATH in 


Output Enable for Lower byte. When LOW, the Lower byte of data is transfered to the port specified by PATH in 
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CAPACITANCE (Ta = +25°C, F = 1.0MHz) 
Parameter!) | Conditions | Max.| Unit| 
Input Capacitance Vin=OV | 8 | pF 


VouT = OV 





Output Capacitance 


NOTE: 


2527 tbl 04 


1. This parameter is guaranteed by device characterization, but is not prod- 
uction tested. 


SEL | Functionality 
Tt [25x (i6-bis)-Read Z| 









XZ (16 bits)-WriteZ | 
_ | YX (16-bits)-Read YO) | 
XY (16 bits)-Write Y 


H All output buffers are 
| disabled . 
X L Transfer of lower 8 bits 


(0:7) as per PATH & T/R 
X X L H Transfer of upper 8 bits 
(8:15) as per PATH & T/R 


NOTES: 2527 tbl 01 
1. For ZX and X-Z transfers, Y-port output buffers are tristated. 
2. For YX and XY transfers, Z-port output buffers are tristated. 


















IDT73720/A 16-BIT TRI-PORT BUS EXCHANGER 


ARCHITECTURE OVERVIEW 


The Bus Exchanger is used to service both read and write 
operations between the CPU and the dual memory busses. It 
includes independent data path elements for reads from and 
writes to each of the memory banks (Y and Z). 
control is managed by a simple set of control signals, analo- 
gous to a simple transceiver. In short, the Bus Exchanger 
allows bidirectional communication between ports X and Y 
and ports X and Z as illustrated in figure 1. 

The data path elements for each port include: 

Read Latch: Each of the memory ports Y and Z contains a 


transparent latch to capture the contents of the memory bus. 


Each latch features an independent latch enable. — 

Write Latch: Each memory port Y and Z contains an indepen- 
dent latch to capture data from the CPU bus during writes. 
Each memory port write latch features an independent latch 
enable, allowing write data to be directed to a specific memory 
port without disrupting the other memory port. 


Data Flow Control Signals 

T/R (Transmit/Receive). This signal controls the direction 
of data transfer. A transmit is used for CPU writes, and a 
receive is used for read operations. 

OEU, OEL are the output enable control signals to select 
upper or lower bytes of all three ports. 

Path: The path control signal is used to select between the 
even memory path Y and the odd memory path Z during read 
or write operations. Path selects the memory port to be 
connected to the CPU bus (X-port), and is independent of the 
latch enable signals. Thus, it is possible to transfer data from 
one memory port to the CPU bus (X) while capturing data from 
the other memory port. | | 





MEMORY READ OPERATIONS 


Latch Mode 


In this mode the read operation consists of two stages. 
During the first stage, the data present at the memory port is 
captured by the read latch for that memory port. During a 
subsequent stage, data is brought from a selected memory 
port to the CPU A/D port X by using output enable control. 

The read operation is selected by driving T/Rlow. The read 
is managed using the Path input to select the memory port (Y 
or Z); the LEYX/LEZX enable the data capture into the 
corresponding Read Latch. 

In this way, memory interleaving can be performed. While 
data from one bank is output onto the CPU bus, data on the 
other bank is captured in the other memory port. In the next 
cycle, the Path input is changed, enabling the next data 


Data flow. 


COMMERCIAL TEMPERATURE RANGE 


element onto the CPU bus, while the first bank is, presented 
with a new data element. | 


Transparent Mode 


The Bus Exchanger may be eae as a data transceiver by — 


‘ leaving all latches open or lenspalent: 


Memory Write Operations 


Memory write operations also consist of two distinct stages. | 


- During one stage, the write data is captured into the selected 


6.5 


memory port write latch. During a later stage, the memory iS 
presented on the memory port bus | 

The write operation is selected by driving T/R high. Writes 
are thus performed using the Path input to select the memory 
port (Y or Z). The LEXY/LEXZ capture data in the correspond- 
ing Write Latch. 

Note that it is possible to utilize the bus exchanger’s write 
resources as an additional write buffer, if desired; the CPU 
A/D bus can be freed up once the data has been captured by 
the Bus Exchanger. 


APPLICATIONS 


Use as Part of the R3051 Family ChipSet 

Figure 2 shows the use of the Bus Exchanger in a typical 
R3051 based system. 

In write transactions, the R3051 drives data on the CPU 
bus. The latch enables are held open through the entire write; 
thus, the bus exchanger is used like a transceiver. The 
appropriate LEXY/LEXZ signal is derived from ALE (Logic 
low- indicating that the processor is driving data) and the low 
order address bit. The rising edge of Wr from the CPU, ends 
the write operation. 

During read transactions, the memory system is respon- 
sible for generating the input control signals to cause data to 
be captured at the memory ports. The memory controller is 
also responsible for acknowledging back to the CPU that the 
data is available, and causing the appropriate path to be 
selected. 

The R3721 DRAM controller for the R3051 family uses the 
transparent latches of the read ports. The R3721 directly 
controls the inputs of the bus exchanger, during both reads 
and writes. Consult the R3721 data sheet for more informa- 
tion on these control signals. 


Use in a general 32-bit System 

Figures 3 and 4 illustrate the use of the Bus Exchanger i in 
a 32-bit microprocessor based system. Note the reduced pin 
count achieved with the Bus Exchanger. 
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Figure 2. Bus Exchanger Used in R3051 Family System 
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DRAM 1 
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Figure 3. CPU System with Transparent Data Path Figure 4. CPU System with Latched Data Path 
(2-way Interleaving) (2-way Interleaving) 
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DC ELECTRICAL CHARACTERISTICS (Vcc = 5.0V + 5%, TA = 0°C to +70°C) 


__Test Conditions™ | 
Input HIGH Level 


a 
ae 
Input HIGH Current Vcc = Max., VIH = 2.7V }Inputsonly = | — | — | 50 | uA 
Inia Secale | acmomeeen case oc an 
A te oe 
i ee 
= 

















= 2. 
Input LOW Current VCC = Max., ViL = 0.5V Inputs only 
| /O. pins 






VIH 
VIL - 
HH 
HL 
Clamp Diode Voltage | Vcc = Min., lin=—18mA 
Short Circuit Current Vcc = Max., Vo = GND | —60.0 
OH 
OL 
' 
Icc | 





Output HIGH Voltage —_| Vcc = Min., VIN = ViHor Vit, lo¥ = —-12mA a ee ee 
Output LOW Voltage Vcc = Min., VIN = ViIH or VIL, lOL = 12mA _ 


| 
Input Hysteresis Vcc = 5V 200.0 mV 
All inputs | 
Quiescent Power Vcc = Max. 0.2 1:5 mA 
Supply Current | VIN= GND or Vcc | | | | 
Alcc Quiescent Power Vcc = Max. 0.5 2.0 mA/ 
Supply Current VIN = 3.4 V4) Input 
| mA/ 
| | MHz 




















Dynamic Power Vcc = Max. 
| Supply Current®) VIN = Voc or GND 
Outputs Disabled 
OE =Vcc. 

One Input Toggling 
50 % Duty Cycle 


Voc = Max. 
VIN = Vcc or GND | 
| Outputs Disabled 
50 % Duty Cycle 
OE = Vcc 

fi = 10MHz 

One Bit Toggling 

















Total Power Supply 
Current ® 









NOTES: 2527 tbl 05 
For conditions shown as Max. or Min., use appropriate Vcc value. | 

Typical values are at Vcc = 5.0V, +25°C ambient. 

Not more than one output should be shorted at a time. Duration of the short circuit test should not exceed one second. 
Per TTL driven input (VIN = 3.4V); all other inputs at Vcc or GND. | 

This parameter is not directly testable, but is derived for use in Total Power Supply Calculations. 

Ic = IQUIESCENT + IINPUTS + IDYNAMIC 

Ic = Icc + Alcc DHNT + Iccp (fcP/2 + fiNi) 

Icc = Quiescent Current 

Alcc = Power Supply Current for a TTL HIGH Input (VIN = 3.4V) 

DH = Duty Cycle for TTL Inputs HIGH 

Nt = Number of TTL Inputs at DH 

Iccb = Dynamic Current Caused by an Input Transition Pair (HLH or LHL) 

fcp = Clock Frequency for Register Devices (Zero for Non-Register Devices) 

fi = Input Frequency 

Ni= Number of Inputs at fi 

All currents are in milliamps and all frequencies are in megaherz. 


OAnon 


AC TEST CONDITIONS 


Input Pulse Levels GND to 3.0V 
Input Rise/Fall Times 5ns 
Input Timing Reference Levels 1.5V 


Output Reference Levels a 1.5V 
Output Load | See Figure 5 
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IDT73720 /A16-BIT TRI-PORT BUS EXCHANGER COMMERCIAL TEMPERATURE RANGE 


AC ELECTRICAL CHARACTERISTICS (Vcc = 5.0V + 5%, TA = 0° to +70°C) 


| 73720A 73720 
ymbol Parameter [Test Conditions | Min.@ | Min. |) Max. | 


tPLH X to Y & X to Z Latches enabled Ci = 50pF 

tPHL RL = 500 Ohms 
tPLH Y to X & Z to X Latches enabled 

tPHL 

tPLH Latch Enable to Y & Z Port LEXY to Y 

tPHL LEXZ to Z 













7) 

















tPLH_ | Latch Enable to X LEYX to X 

tPHL LEZX to X 

tPLH Path to X Port Propagation Delay 

tPHL 

tHZ Y & Z Port Disable Time (T/R, PATH, OEU, OEL)® 

tLz 

1ZH Y & Z Port Enable Time (T/R, PATH, OEU, OEL) 

tZL 

tHZ X-Port DisableTime (T/R, OEU, OEL)°) 

tLZ | 
ZH X-Port Enable Time (T/R, OEU, OEL)®) 

{ZL 
aa 
H ee ee 


O 





Port to LE Set-up time | 
Port to LE Hold time | 


NOTES: 2527 tbl 07 
1. All timings are referenced to 1.5 V. 

2. This parameter is guaranteed by design, but not tested. 

3. Bus turnaround times are guaranteed by design, but not tested. (T/R enable/disable times). 








TEST CIRCUITS AND WAVEFORMS SWITCH POSITION 
Veo No-e7ov [Test Switch 









Disable Low Closed 








500Q Enable Low . 
Al Other Tes a 
| _ Pulse DEFINITIONS: 2527 tbl 08 






| Generator CL = Load capacitance: includes jig and probe capacitance. 


Rt = Termination resistance: should be equal to ZouT of the Pulse 
5000 Generator. 
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Figure 5. Test Circuit for all outputs 
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IDT73720/A 16-BIT TRI-PORT BUS EXCHANGER 


SET-UP, HOLD AND RELEASE TIMES 


_3V 
DATA 
=. 5V 
{SU OV 


TIMING 






| INPUT T15V 
ASYNCHRONOUS CONTROL 
PRESET —3V 
CLEAR 15V 


| ETC. 
SYNCHRONOUS CONTROL 






_ COMMERCIAL TEMPERATURE RANGE 
PULSE WIDTH 


LOW-HIGH-LOW 


PULSE 1.5V 


HIGH-LOW-HIGH 
PULSE 


1.5V 





2527 drw 09 

















tSU tH OV 
2527 drw 08 
PROPAGATION DELAY ENABLE AND DISABLE TIMES 
= 3V ENABLE DISABLE 
SAME PHASE 1.5V 3V 
INPUT TRANSITION OV 
| CONTROL 1.5V 
VoH INPUT OV 
OUTPUT = SY 
VOL OUTPUT 3.5V 3.5V 
| NORMALLY SWITCH N. 4 sy 
OPPOSITE PHASE 1.5V a ia OL 
INPUT TRANSITION 7 tPZH 
VOH 
| OUTPUT swiTcH 0.3V 
2527 drw 10 NORMALLY OPEN 
HIGH oy ov 
2527 drw 11 
NOTES: ; 
1. Diagram shown for input Control Enable-LOW and input Control Disabie- 
_ HIGH. 
2. Pulse Generator for All Pulses: Rate < 1.0 MHz; ZO < 50Q; tF < 2.5ns; tr 
. , < 2.5ns. 
ORDERING INFORMATION 
IDT 73720 X —  X X 
Device Speed Package Process/ 
Type Temperature 
Range 
Blank Commercial Temperature Range 
J 68-Pin PLCC 
PQ 80-Pin PQFP 
Blank Standard Speed 
A High Speed 
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RASTER IMAGE PROCESSOR ADVANCE 
Integrated SystemController™ INFORMATION 





for IDT R3051 Family IDT79R3730 





| Integrated Device Technology, Inc. 





FEATURES: * Programmable Interrupt Controller 
+ Integrated SystemController™ for IDT R3051 Family * High-performance programmable video interface 
Raster Image Processors and Laser Printer Controllers 1, 2 or 4 serial video streams support high video 
— Direct interface to R3041, R3051, R3052 and R3081 bandwidth (bi-level or 4-bit color) 
— Supports clock frequencies to 40MHz — Video DMA channels with chaining 
¢ High-performance, programmable DRAM controller: — On-chip 8-word x 32-bit video FIFO 
— Flexible DRAM control for up to 128MB of two-way — Programmable margin counters 
interleaved or non-interleaved DRAM (up to 8 banks) Video bit rates to 160 megabits per second 
— Wide variety of memory configurations (e.g. different Supports split stream video 
size DRAMs for base and SIMM extensions) Pel counter 


— DMA channel for memory pattern fill/clear Controls for ROM memory system . 
— DRAM parity generation/checking — Programmable controls for up to 24MB of interleaved 


* Programmable I/O ports support glue-less interface or non-interleaved ROM/EPROM (up to 6 banks) 
support low-cost peripherals =  BUlst ROM/ enn Supper 
— Burst DMA channels with chaining — Glue-less interface for 8-bit boot ROM or EEROM 


— On-chip 4-word x 32-bit FIFOs with data packing and =~ General PETROS functions 
— Bus timeout counter 


unpacking ) inter | 
— Master/slave peripheral interface ~ Rotate assist (0°, 90°, 180°, 270° and mirror) hard- 
— 8-bit and 16-bit I/O ports ware 


— General purpose counter/timer 


* Coprocessor DMA interface for accelerator ASICs 
— Bit programmable !/O port 


(e.g. Adobe's Type 1 font rasterizer and 
PixelBurst™ display list coprocessor) 
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Programmable 
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Figure 1. IDT79R3730 System Organization 2906 drw 01 


The IDT logo is a registered trademark and RISController, R3041, R3051, R3052, R3081, and SystemController are trademarks of Integrated Device Technology, Inc. 
Adobe, the Adobe logo, PostScript, PixelBurst, and the PostScript logo are trademarks of Adobe Systems Incorporated which may be registered in certain jurisdictions. All others are trademarks of their 
respective companies. 
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Figure 2. IDT79R3730 Block micuen 


GENERAL DESCRIPTION 


The IDT79R3730 SystemController™ is a highly inte- 
grated and highly programmable controller for high-perfor- 
mance Raster Image Systems. 


matching between the high-speed CPU/memory bus and 
_ slower standard I/O peripherals. The IDT79R3730 provides 
- Standard interfaces for the R3051 family of RISControllers™ , 
a high-performance, flexible memory system controller, an 
optional coprocessor accelerator interface, controls for indus- 


try standard VO peripherals, and a nigh- Bown nenee engine 


interface. 


Int(2:0 2 





Figure 3, CPU to IDT79R3730 Interface 
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The IDT79R3730 provides unique system flexibility by 
providing comprehensive programmability of the system in- 
terface functions andtiming parameters. These include controls 
for memory configuration and device size, edge timing for 
DRAM and I/O peripheral control signals, and controls for the 
format and timing of engine interface signals. Each system 
interface is defined based on specific system goals for that 
interface. Standard IDT79R3730 feature selections assume 
that most printer OEMs want a low base unit price, with 
modular add-ons for memory, !/O and accelerators. 


CPU INTERFACE 


IDT79R3730 communicates with the CPU via a multi- 
plexed address/data bus and control interface that is a pin-for- 
pin, glue-less match with the control signal and timing stan- 
dards for IDT's R3051 RlSController os neaMng R3041, 
R3051, R38052, and R3081. 

A key system goal of |DT79R3730 is to integrate most 
general purpose logic to lower system cost, while preserving 
OEM flexibility for product differentiation through CPU and 
clock frequency choices, memory timing/tuning and addition 
of coprocessor accelerators. 

- A primary performance goal is to run the CPU bus at 
maximum utilization and frequency, with a minimum of stalls 
caused by memory system latencies or I/O device bandwidth 
limitations. IDT79R3730's on-chip FIFOs, arbitration logic 
and DMA capability allow I/O transfers to burst in and out of 
system memory atmaximum memory system bandwidth. The 
CPU bus remains at minimum loading with only the CPU, 
IDT79R3730, an address latch, memory system buffers/ 
transceivers and the system coprocessor (if used). 
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Figure 4. IDT79R3730 Two-Way Interleaved Memory System 


MEMORY SYSTEM INTERFACE 

The IDT79R3730 provides the controls for a wide range of 
DRAM and/or ROM based memory system configurations. 
Options range from DRAM-only systems where the inter- 
preter and fonts are down-loaded from the host and run out of 
DRAM, to ROM based systems including interpreter and 
fonts, or systems including SCSI disc for font caching. 

Controls are provided to support base-plus-extension 
memory organizations where the minimum DRAM configura- 
tion is installed in the board and optional memory is added by 
the end user (typically in SIMMs). The base and extension 
options can be configured with different size devices. There 
can be 1, 2 or 4 base memory banks, and 1 to 6 extension 
banks, up to a total of 8 banks of DRAM. Configuration 
Register controls are provided to allow the user to specify 
different device sizes in each area, and how much memory is 
addressable. The memory system can be all two-way inter- 
leaved or all non-interleaved. ee ed 

The IDT79R3730 provides the RAS, CAS, chip select and 
enable controls for a wide variety of configurations. Since the 
target applications for IDT79R3730 typically have large 
amounts of memory, 1DT79R3730 provides enable controls 
for external latches/transceivers. I!DT79R3730 supports 
page mode accesses, early write cycles, and CAS before RAS 
refresh. 

The IDT79R3730 Configuration Registers provide control 
for each of the various timing edges to allow precise memory 
system tuning to get the best performance at each cost point. 


This makes it possible to independently scale CPU perfor- 
mance over a wide frequency range to achieve the desired 
cost/performance balance. Timing resolutions down to one 
half the period of the CPU clock are selectable for memory 
system conirol timing via the Configuration Registers. 


BOOT MEMORY CONTROL 


The IDT79R3730 always starts up from a boot ROM. 
Therefore, IDT79R3730 provides a single 8-bit boot ROM 
control interface in conjunction with 8-bit data Port A. One of 
the IDT79R3730 Configuration Registers provides timing 
control for this interface. 4 MB of address space is supported. 
Timing from chip select to first data sampling and to next 
access Is programmable to accommodate device timing. 


PROGRAMMABLE ENGINE INTERFACE 


The video data is transferred via DMA from system memory 
to the 8-deep by 32-bit wide video FIFO, and is then serialized 
for the video output(s). DMA chaining is supported for video 
transfers, and can be set for single line transfers, multiple line 
transfers, or full page transfers. The IDT79R3730 supports 
duplex printing. 

The video interface provides one, two or four serial data 
streams, plus synchronization signals. Four video data streams 
provide lower clock rates for high-bandwidth bi-levei, or 4-bit 
parallel output for grey scale or 4 or 8-bit continuous tone 
color. 
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Figure 5. Boot ROM Connections 


The IDT79R3730 engine interface includes VCLK, a video 
clock input from an external pixel clock oscillator, and SHCLK, 
a programmable mode shift clock output indicating when 
video data is valid. Inputs are also provided for LSync (line 
sync, or beam detect), an input from the engine indicating 
when to start a new scan line, and PSync (page sync), an 
input indicating the start of a new page. 

Vertical and horizontal page margins are supported by 
programmable "skip counters" that allow specification of the 
number of vertical lines or horizontal dots to be skipped before 
starting to print. 

A "pel" counter is provided to count the number of black 
dots printed. This provides a measure of toner usage. 

The IDT79R3730 supports split stream video for LED page 
printers. | 








VData0d 



















Engine Interface 
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Figure 6. Engine Interface 


GLUE-LESS I/O CHANNELS 

The IDT79R3730 supports the end-product strategy that 
the minimally configured product must be low-cost with modu- 
lar addition of a wide range of point-to-point or network I/O 
options. The options must also be low-cost and as glue-less 
as possible, even though IDT79R3730 targets high-perfor- 
mance products. The |DT79R3730 provides the "intelligence" 
needed to obtain maximum bandwidth from low-cost standard 
/O peripherals by incorporating DMA, FIFOs and packing/ 


_ unpacking logic. Therefore, the I/O option strategy can take | 
_ advantage of the lowest cost device options (e.g. 53C90 for — 


SCSI, 85C30 for Appletalk, 82593 for Ethernet), rather than 
peripherals with higher cost because of their wide data buses 
or on-chip CPUs and DMA controllers. 

Two independent I/O ports are provided. Port A provides 


channels AO and A1 to support one or two 8-bit devices, and 


provides the data path for Boot ROM. Port B provides 
channels BO, B1 and B2 to support up to three 8 and/or 16-bit 
devices. 

Channels AO and BO are associated with 16MB of address 
space, and are therefore, appropriate locations for font ROM 
(cartridge or board) addition. Ports A1, B1 and B2 are 
associated with 16KB of address space, and are appropriate 
choices for I/O peripherals. _—. 

Individual channels within I/O Ports A and B share Rd, Wr 
and data lines, but have individual chip select, and DMA 
request/acknowledge lines. Wait, Burst and the timing rela- 
tionships between Rd/Wr and CS/DACK are programmable 
for each channel via the Configuration Registers. This allows 
customization of I/O channel control timing for each specific 
/O device. 
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Figure 7. /O Port Options 
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Figure 8. I/O Port Logic | DMA BURST I/O 


Data bus A and B each have a four word deep, 32-bit wide, High speed I/O transfers, DRAM frame buffer Pattern Fill 


bi-directional FIFO, and the logic to pack and unpack bytes (Clear), and video output data transfers are supported by 
(and 16-bit half words for Port B) to 32-bit words (Figure 8). eleven independent DMA channels. IDT79R3730 breaks all 
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Enable Control 
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Figure 10. IDT79R3730 DMA Architecture 
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data transfer requests into burst transfers of 1—16 bytes. 
Channels A1, B2 and Pattern Fill have single Master channel 
register pairs. Channels AO, BO, B1 and video also have 
associated chaining register pairs. The data in the chaining 
register pair is automatically transferred to the master register 
pair when the master count reaches zero. The chaining 
register pair allows a "next" packet to immediately begin 
transfer without incurring the delay required to reprogram the 
master channel, an important feature in high-performance 
DMA implementations (e.g. saves memory for I/O by allowing 
use of smaller buffers). | 

Each register pair has an associated "valid" bit that indi- 
cates when the data in the count register is valid. An active- 
high Master valid bit indicates thatthe data associated with the 
Master address/count transfer has not completed transfer. An 
active-high Chain valid bit indicates that the data in the chain 
address/count register has not yet been transferred to the 
Master register pair. 

Controls are provided to set a variety of priority scenarios. 
Channels can be assigned to different priority groups, be 
disabled or have round-robin priority. There are four indepen- 
dent DMA state machines: Port A, Port B, Video and DRAM. 


COPROCESSOR SUPPORT 


The IDT79R3730 provides a generic 32-bit multiplexed 
interface for various coprocessor configurations using a com- 
bination of CPU interface signals Rd, Wr and ALE, plus 
coprocessor interface signals DRQ, DACK and DS. The 
coprocessor resides on the CPU's A/D bus. This provides a 
straighforward interface (one PAL plus data buffers) to sup- 
port standard devices such as Adobe's Type 1 font rasterizer 
or PixelBurst display list coprocessor. 
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INTEGRATED INTERRUPT CONTROLLER 


The IDT79R3730 contains an interrupt controller with mul- 
tiple interrupt Sources and programmable grouping into high 
or low-priority CPU. interrupt inputs. There is a Configuration 
Register for masking individual interrupt sources, and an 
interrupt cause register. 


TIMER/COUNTERS 


A programmable 9-bit Timeout Counter and a general 
purpose programmable 27-bit Timer/Counter are included 
and incorporated into the interrupt structure. 

The Timeout Counter can be used as a CPU timeout 
counter (counting from the start of a CPU active bus cycle to 
the assertion of Ack), or as a DMA timeout counter (acounting 
from the assertion to the de-assertion of BusGnt). On a CPU 
timeout, BErr is asserted for one cycle and an interupt is 
asserted. For a DMA timeout, DMA acknowledge (Ack) and 
an interrupt willbe asserted, and BusReq will be de-asserted. 

The general purpose Timer/Counter can be programmed 
as either function. As a counter, counting will stop with the 
assertion of TCnt when terminal count is reached. As a timer, 
TCnt willbe asserted and the programmed count value will be 
reloaded when terminal count is reached. 








ROTATE ASSIST 


The Rotate Assist logic performs 0°, 90°, 180° or 270° 
rotations and mirror rotations on a 16x16 bit array. 


CONFIGURATION REGISTERS 


The IDT79R3730 is configured at boot time by program- 
ming various fields in 49 internal Configuration Registers. The 
IDT79R3730 occupies 128KB of address space. The location 
of this space (the Base Address) is programmable via the 
Base Adaress register. This register's address is 1 FFXXXXX. 

The other registers are at the offset relative to the base 
address as shown in Table 1. 
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| offset Register Name/Function 
! 0000 General Purpose Registers 


R3730 base address register 
CPU interface 
CS and DRAM base address 
CS and External I/O base addresses 













Channel priority 
Pattern fill data 
[i000 | BRAN Control Configuration Registers 
1000 Size and RAS timing 
1004 CAS and refresh timing 
| = 2000 /O Bus A Configuration Registers 
| 2000 Channel AO and A‘1 interface timing 
2004 Boot CS timing 
| 3000 /O Bus B Configuration Registers 
| 3000 Channel BO and B1 interface timing 
3004 Channel B2 interface timing 
4000 DMA Configuration Registers 
| Channel AO DMA address 
Channel A1 DMA address 
Channel BO DMA address 
Channel B1 DMA address 
Channel B2 DMA address 
Channel AO DMA chain address 
Channel BO DMA chain address 
Channel B1 DMA chain address 
Channel AO DMA count 
Channel A1 DMA count 
Channel BO DMA count 






































Table 1. IDT79R3730 Register Address Offset _ 
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Register Name/Function 


Channel B1 DMA count 
Channel B2 DMA count 
Channel AO DMA chain count 
Channel BO DMA chain count 
Channel B1 DMA chain count 
Pattern fill address 
Pattern fill count 
[5000 [Video Configuration Registers | 
Video interface 
Pel counter 
Margin widths 
Margin widths chain register 
Video buffer DMA address 
Video buffer DMA chain address 
Addressing mode and DMA count 
Addressing mode and DMA chain count 
Test mode left 
Test mode right 


6000 Rotate Assist Configuration Register 
7000 System Configuration Registers | 


Parallel 1/0 mode register 

Parallel I/O 

Bus time out count 

General purpose timer/counter mode/count 


‘Interrupt cause register 


Interrupt mask register 
Interrupt priority register 
CSO, CS1 and CS2 timing and address space 
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PIN DESCRIPTION | 
PINNAME VO. DESCRIPTION 
CPU Interface _ 
| A/D(31:0) /O  Address/Data: These signals are connected directly with A/D(31 :0) of the CPU. 


This isa 32-bit time multiplexed bus which indicates the desired address for a bus transaction in one ase 
and which is used to transmit data between the CPU and system memory resources during the rest of the 
transfer. 


Transactions on this bus are logically separated into two phases: during the first address phase, | 
information about the transfer is presented to the memory system to be captured using the ALE output. | 
This information consists of: 


Address(31 :4): The high-order address for the transfer is presented on A/D(31:4). 


BE(3:0): These strobes indicate which bytes of the 32-bit bus will be involved in 
the transfer, and are presented on A/D(3:0). 


. The use of the bus during the second data phase depends on the type of transfer. Duting CPU write ayciss 
the bus contains the data to be stored and is driven from the CPU's internal write buffer. On read cycles, 
the bus receives the data from the external resource, in either a single data transaction or in a burst of four 
words, and places it into the on-chip read buffer. 


Addr(3:2) Low Address (3:2) A 2-bit bus which indicates which word is currently expected by the processor. 


Specifically, this two bit bus presents either the address bits for the single word to be transferred (writes 
or single datum reads) or functions as a two bit counter starting at ‘00’ for burst read operations. 


Address Latch Enable: Used by the CPU to indicate that the A/D bus contains valid address information 
for the bus transaction. This signal is used by IDT79R3730 logic to capture the address for the transfer. 


Burst Transfer: On read transactions, the Burst signal indicates that the current bus read is requesting 
a block of four contiguous words from memory. This signal is used ane in read cycles. This pin connects 
to the CPU's Burst/WrNear | 


Read: A CPU output which indicates that the current bus transaction is a read (single or burst). 
Write: A CPU output which indicates that the current bus transaction is a write. 


O Acknowledge: An output to the CPU which indicates that the memory system has sufficiently 
processed the bus transaction, and that the CPU may either terminate the write cycle or process the read 
data from this read transfer. 


O Read Buffer Clock Enable: A IDT79R3730 output which indicates to the CPU that the memory system 
has placed valid data on the A/D bus, and that the processor may move the data into its on-chip Read 








| Buffer. 
SysClk System Reference Clock: An input to |IDT79R3730 from the CPU which reflects the timing of the processor 
bus interface. This clock is used as a timing reference for the system interface. 
BusReq O DMA Arbiter Bus Request: An output to the CPU which requests that the CPU tri-state its bus interface 
| signals so that they may be driven by an external master. 
| BusGnt DMA Arbiter Bus Grant. An input to |IDT79R3730 from the CPU used to acknowledge that a BusReq has 
been detected, and that the bus is relinquished to the external master. 
BErr O Bus Error: Output to the CPU bus interface unit to terminate a bus transaction due to an external bus 
error. 


O Processor Interrupt: Interrupt outputs to the CPU from IDT79R3730, providing prioritized interrupt during 
normal operation and system configuration input during reset in conjunction with the Config(2:0) inputs. 


DataEn Data Enable: This input from the CPU indicates that the A/D bus Is no longer being driven by the CPU 
during read cycles, thereby allowing the IDT79R3730 to enable the memory system drivers onto the 
A/D bus without a bus conflict. 
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PINNAME I/O DESCRIPTION | 
FST avr denn I EEUU, 


CS(2:0) O General Purpose System Bus Chip Select: Three Chip Select outputs associated with the 32-bit CPU 


high speed bus. The memory space associated with each chip select is programmable from 512KB to 4MB | 
in non-interleaved mode, and 1MB to 8MB in interleaved mode. 





IntA3 O Interleave A3: This is an address output which is substituted for the latched address A3 from the A/D data 
bus when the ROM/EPROM memory is interleaved. 


OEM/AD(QOdd) Buffer Control Output Enable: Output Enable for the data path buffer from the memory system to the A/ 
D bus for the odd bank in interleaved memory systems. 


O 
OEMAD(Even) O Buffer Control Output Enable: Output Enable for the data path buffer from the memory system to the A/ 
D bus for the even bank in interleaved memory systems. 





OEADM O Buffer Control Output Enable: Output Enable for the data path buffer from the A/D bus to the memory 
system data bus. | 

LEMAD O Buffer Control Latch Enable: Latch Enable for the data path from the memory system data bus to the A/D 
bus. 

LEADM(Odd) O Buffer Control Latch Enable: Latch Enable for the data path from the A/D bus to the memory system for 


the odd bank in interleaved memory systems. 


LEADM(Even) O Buffer Control Latch Enable: Latch Enable for the data path from the A/D bus to the memory system for 
the even bank in interleaved memory systems. 


DRAM Interface 


DAddr(10:0) O DRAM Address: These outputs are typically connected to the DRAM multiplexed row/column address 
inputs. The relationship between CPU address and DRAM address is a function of DRAM size and 
organization. 


Row Address Strobe: These outputs are directly connected to the RAS inputs of the DRAMs on a bank 
basis. The falling edge of RASis used by the DRAMs to capture the row address presented on DAddr(10:0). 


| CASOdd(3:0) O Column Address Strobe: These outputs are directly connected to the CAS inputs of the DRAMs on a 
byte basis. The falling edge of CAS is used by the DRAMs to capture the row address presented on 
DAddr(10:0). These signals are identical to CASEven in the non-interleaved mode. 


CASEven(3:0) O Column Address Strobe: These outputs are directly connected to the CAS inputs of the DRAMs on a 
byte basis. The falling edge of CAS is used by the DRAMs to capture the row address presented on 
DAddr(10:0). These signals are identical to CASOdd in the non-interleaved mode. 









Write Enable: This output is used by the DRAM memory system to determine if a memory access is a read 
or write cycle (W = LOW for a write cycle, W= HIGH forareadcycle). | | 





| Video Interface 
| VCLK Video Clock Input: An input clock from the print device running at the video dot rate 


| 
SHCLK O Shift Clock Output: The programmable mode Shift Clock output from IDT79R3730 to the printer device 
indicating when Video Data output is valid. 






VData(3:0) O Video Data Outputs (serial): Video Data output to the printer device, typically formatted through one bit 
stream from VData0, two data streams from VData0 and VDatat, or four streams from all four outputs (to 
limit video frequency). 


/O Line Sync Input/Output: Depending on the configuration of the printer/output device, LSync can be used 
as On output from the IDT79R3730 to the device to indicate that valid data is present. It can also be used 
as an input to IDT79R3730 to begin transfer of the next line of video. 





PSync | Page Sync Input: This is an input to IDT79R3730 from the output device to begin transfer of the next page. | 
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IDT79R3730 RIP Integrated SystemController™ 


PIN NAME 
VO PortA 


| DataA(7:0) 


| BootAddr(1:0) 


BootCS 
| VO Port B 
DataB(15:0) 


DRQB(2:0) 





DESCRIPTION 


Port A Data Bus: Bidirectional 8-bit data bus for Port A, for channels AO, A1, and Boot ROM during startup. | 


Port A Read: An IDT79R3730 output which mgleates: that the current port A nus transaction is a read HOM, | 
the pelibhetauenanne’ selected. 


Port A Write: An |IDT79R3730 output which indicates that the current port A bus transaction is a write to’ 
the peripheral/channel selected. 


Port A Chip Selects: IDT79R3730 Chip Select outputs for channels A1 and AO. an | 


Port A DMA Request: IDT79R3730 inputs for channels A1 and AO requesting DMA service for the 
respective peripheral/channel. 


Port A DMA Acknowledge: IDT79R3730 outputs for chiannels Ai and AO indicating that DMA service 


requested by a respective peripheral/channel is acknowledged and that bus access is granted. 


Port A Wait Request: An input from a Port A device that indicates that a transfer cycle needs to be extended. 
An active Wait input will halt the state machine control for port A. 


Port A Done: An open collector pin that goes low to indicate the end of a DMA cycle, or can be driven low 
to terminate the current DMA cycle for an A channel. 


Boot Address (Port A): These outputs supply the byte address in conjunction with the word address 
latched from the A/D bus for the a byte wide boot ROM/EPROM using the A port data bus. 


Port A Boot ROM Chip Select: Chip Select for the Boot ROM device during startup. 


Port B Data Bus: Bidirectional 8/16-bit data bus for Port B channels BO, B1 and B2. 


Port B Read: A IDT79R3730 output which indicates that the current port B bus transaction is a read from 
the peripheral/channel selected. 


Port B Write: A IDT79R3730 output which indicates that the current port B bus transaction is a write Hoi 


‘the peripheral/channel selected. 


Port B Chip Selects: IDT79R3730 Chip Select outputs for channels BO, B1 and B2. 


Port B DMA Request: IDT79R3730 inputs for channels BO, BI and B2 requesting DMA service for the 
respective peripheral/channel. 


Port B DMA Acknowledge: !DT79R3730 outputs for channels BO, B1 and B2 indicating that DMA service | 
requested by a respective peripheral/channel is acknowledged and that bus | access is granted. 


Port B Wait Request: An input from a Port B device that indicates that a transfer cycle needs to be extended. 
An active Wait input will halt the state machine control for port B. 


Port B: An open collector pin that goes iow to indicate the end of a DMA cycle, or can be driven low to 
terminate the current DMA cycle for a B channel. 


_ Port B Byte Enables: The Byte Enables indicate which byte is valid for 16-bit peripherals. 


Port B Half-Word Select: This output provides address A1 for 16-bit devices and indicates how 16- bit data 


~ Input to Port B is interpreted on a half-word basis. 
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PIN NAME 


VO 


System Interface 


Config(0) 


| PIO(11:0) 


/O 


IDT79R3730 RIP Integrated SystemController™ 
DESCRIPTION 


Configuration Input 0: Config(0) is an IDT79R3730 input wired high or low to set the Big/Little endian 
format during Reset for the A/D bus within IDT79R3730. During Reset, Config(0) also controls the state 
of IntO, which is connected to CPU Int0 to set the endian format of the CPU. After completion of the reset 
cycle, IntO resumes its normal function as an asynchronous, maskable, interrupt input to the CPU. 


Configuration Input 1: Config(1) is an IDT79R3730 input wired high or low to set the state of output Int 
during Reset, and therefore,the state of the CPU input Int1 is connected to. After completion of the reset 
cycle, Intl resumes its normal function as an asynchronous maskable interrupt input to the CPU. 


Configuration Input 2: Config(2) is an IDT79R3730 input used in conjunction with the Test input. During 
IDT79R3730 Reset, if Test is set low, Config(2) is a IDT79R3730 input wired to set the state of Int1. If Test | 
is set high during IDT79R3730 Reset, a Config(2) high input will cause |DT79R3730 to be reset with all | 
IDT79R3730 outputs three-stated for board testing. If Test is set high during IDT79R3730 Reset, a 
Config(2) low input will cause IDT79R3730 to be reset in a mode used only for factory test. 


Coprocessor DMA Request: An IDT79R3730 input to request DMA service for the coprocessor 
interface, using the A/D data bus. 


Coprocessor DMA Acknowledge: An|DT79R3730 output indicating that DMA serviceforthe coprocessor | 
interface is acknowledged and that bus access is granted. 


Data Strobe: This input indicates to the IDT79R3730 when data on the A/D bus Is valid. 


Programmable I/O: The PIO pins provide three general purpose functions: programmable bit-level |/O, 
maskable interrupt input, or parity generation/checking between the IDT79R3730 and DRAM. Each pin 
can be programmed to be an input or an output. Their function is set by programming individual bits/fields 
in the appropriate Confuguration Registers. PlO(11:0) are sub-divided into three 4-bit fields. Two 
Configuration Register bits determine if fields PIO(3:0) and PIO(7:4) are to function as I/O bits or as parity 
fields for DRAM. If they are to be usedfor parity, PIO(3:0) provides parity bits forthe even bank (interleaved) 
and PiO(7:4) provides parity for the odd bank. | 


If PIO bits are used as I/O, additional Confuguration Register bits determine if individual PIO bits are to 
be used for input or output. In the input mode, a low level input will be interpreted as a maskable interrupt, 
readable from the cause register. 


Reset IDT79R3730: This input resets the IDT79R3730. 
Terminal Count: This output indicates that the timer/counter has decremented to zero. 


Test Mode Input: Used in conjunction with Config(2) to three-state IDT79R3730 outputs for board test. 
See Config(2) description. 
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Integrated Device Technology, Inc. 


256KB/ 1MB/ 4MB 


IDT79R4000 SECONDARY CACHE IDT7MP6074 | 
“MODULE BLOCK FAMILY IDT7MP6084 








PRELIMINARY 






IDT7MP6094 








FEATURES: 


High-speed BICMOS/CMOS secondary cache module block 
constructed to support the IDT79R4000 CPU 

Available as a pin compatible family to build 256KB (uni- 
fied), 1MB (unified) and 4MB (unified or split) secondary 
caches 

Zero wait-state operation 

Four-word line size a2 
Operating frequencies to support 50MHz and 75MHz 
IDT79R4000 

Available as a set of four identical high-density 80-lead 
(gold-plated fingers) SIMMs (Single In-Line Memory Mod- 
ules) 


e 


e 


laminate (FR-4) substrate 

Multiple ground pins and decoupling capacitors for maxi- 
mum noise immunity 

TTL compatible I/Os 

Single 5V (10%) power supply 


Surface mounted plastic components on a multilayer epoxy | 


DESCRIPTION: 


The IDT7MP6074 is a 256KB IDT79R4000 secondary 
cache module block constructed on a multilayer epoxy lami- 
nate substrate (FR-4), using eleven 16K x 4 SRAMs and two 
IDT74FBT2827 drivers. The IDT7MP6084 is a 1MB 
IDT79R4000 secondary cache module block using eleven 


~ 64Kx4SRAMs, andthe IDT7MP6094 isa 4MB IDT79R4000 


secondary cache module block using eleven 256K x 4 static 


RAMs. The IDT74FBT2827 has internal 25W series resistors 


and BiCMOS 1/Os resulting in the fastest propagation times 
with minimal overshoot and ringing. Four identical cache 
module blocks comprise a full secondary cache. 

The IDT7MP6074/84/94 support use in an IDT79R4000- 
based system at speeds of 50MHz and 75MHz with zero wait- 
state operation. These modules support a four word line size. 
For other line sizes, please consult factory. 

All inputs and outputs of the IDT7MP6074/84/94 are TTL- 
compatible and operate from a single 5V supply. Fully asyn- 


~ chronous circuitry is used, requiring no clocks or refresh for 


operation. 





FUNCTIONAL BLOCK DIAGRAM 


oe — 
aa 74FBT2827 

: ee 
TCS 
BCS 


Ao 
WE aos 









256K x 8 
TAG 





256K X 36 
DATA 





36 
1/O0-35 To-7 
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PIN CONFIGURATION PIN NAMES 






Ao-17 Address Inputs 
DCs | DataChip Select 
|TS | TagChipSelect_ 


2833 tbl 02 







CAPACITANCE 


Symbol| Parameter) | Conditions_| Max. | Uni 
Input Capacitance (Data)|_vin=0V__| 10_| 


CIN(A) Input Capacitance VIN = OV 10 | pF 
(Ai-15, OE, TCS, DCS) | 

CIN(B) Input Capacitance VIN = OV 100 | pF 
(Ao, WE) | 


Output Capacitance VouT = OV 


NOTE: 2833 tbl 03 
1. This parameter is guaranteed by design, but not tested. 










RECOMMENDED DC OPERATING 
CONDITIONS 








NOTE: 2833 tbl 04 
1. VIL =-—1.5V for pulse width less than 10ns. 


SIMM ABSOLUTE MAXIMUM RATINGS 


TOP VIEW 


Symbol | Unit | 
VTERM Terminal Voltage with Respect |-0.5to+7.0] V 
to GND 


iste | Storage Temperature ___|-85t0 +128 | C 


RECOMMENDED OPERATING NOTE: eae 
TEMPERATURE AND SUPPLY VOLTAGE 1. Stresses greater than those listed under ABSOLUTE MAXIMUM _RAT- 


Ambient INGS may cause permanent damage to the device. This is astress rating 
| | Temperature | Vcc 


only and functional operation of the device at these or any other conditions 


above those indicated in the operational sections of this specification is 
2833 tbl 01 














not implied. Exposure to absolute maximum rating conditions for ex- 
tended periods may affect reliability. 
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DC ELECTRICAL CHARACTERISTICS | 
(Vcc = 5V + 10%, TA = 0°C to +70°C) | _ 
Parameter [| _Test Conditions 
| Input Leakage (except Ao, WE) 


Input Leakage (Ao, WE) Vcc = Max., VIN = GND to Vcc 


Output Leakage | | Vcc = Max., CS = VIH, Vout = GND to Vcc | | 
Operating Current CS = VIL; Vcc = Max., Outputs Open ee a 2200 | ma | 


Output HIGH Voltage | Vcc = Min., loH =—4mA 








Output LOW Voltage . Vcc = Min., lo. = 8mA 
| 2833 tb! 06 
AC TEST CONDITIONS — 
| Input Pulse Levels GND to 3.0V | 
Input Rise/Fall Times 5ns | 
Input Timing Reference Levels 1.5V 
Output Reference Levels | 1.5V | 4800 
Output Load | See Figures 1 - 4 | 
2833 tbl 07 
DATA out 
+5V 
| 2550 5pF* 
4800 | 
| 2833 drw 04 
DATA out 
. Figure 2. Output Load 
2550, 30pF (for toz and toxz) 
* Including scope and jig. 


Oo WO fF A DN BW 


2833 drw 03 
Figure 1. Output Load DTAA 
(Typical, 
ns) | 
DATAouT 
1 
500. 


20 40 60 80 100 120 140160 180 200 
1.5V CAPACITANCE (pF) 


2833 drw 05 





2833 drw 06 


Figure 3. Alternate Output Load Figure 4. Alternate Lumped Capacitive Load, 
Typical Derating 





6.7 3 


IDT7MP6074/84/94 (256K B/1MB/4MB) | 
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AC ELECTRICAL CHARACTERISTICS 
Vcc = 5V+10%, TA = 0°C to +70°C 

7MP6074/6084/6094SxxM 

Symbol | Parameter | Max. | | 

READ CYCLE 


Address Access Time 


Min. | 
= 
[taoa___| AoAccessTime | — | 
| Output Enable to Output Valid | — | 

= 

a 





a 
on 
w 1S) 
Oo jo) 
3 |D _) =) 


tAA 
tOE 


torz! 2 
WRITE CYCLE 

taw__| Address Valid to End-of-write | 12 | —_ 
wow | AoValidtoEnd-of-write | 10 | — | 
we | WritePulsewiath | 7 | — | 
tDw -of-Wri = 


toHz"") 
1) 


A 
Nh 
t 
ae) 
i 
. 
i 
ine) 
f 
a 





Ntio 


Data Valid to End-of-Write 
tDH Data Hold Time ns 


NOTE: : 
. This parameter is guaranteed by design but not tested. 


P10 


—_ 


ADDR ADDR VALID | 


tAA 
oE 6 
DATAouT . ii DATA VALID 


NOTE: 


. This parameter is guaranteed by design, but not tested. 2833 drw 07 


— 


TIMING WAVEFORM OF WRITE CYCLE 


ADDR ADDR VALID 


$$ TA 
twP 


tDw {DH 


DATAIN "4 DATA VALID | 


$) 


2833 drw 08 
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PACKAGE DIMENSIONS 


FRONT VIEW 


4.640 
4,660 











0.940 
0.960 
0.240 
0.260 PIN1 TYP TYP 0.390 SIDE VIEW 
0.410 
BACK VIEW ‘ 
PIN 1 
2833 drw 09 
ORDERING INFORMATION 
IDT XXXX X XXX X X 
Device Power Speed Package Process/ 
Type Temperature 
Range 
fae Blank Commercial (0°C to +70°C) 
M FR-4 SIMM (Single In-line Memory Module) 
12 \ 
15 
us Speed in Nanoseconds 
25 | 
30 
S Standard Power 


7MP6074 256KB IDT79R4000 Secondary Cache 
odule Block 
7MP6084 1MB IDT79R4000 Secondary Cache 
Module Block 
7MP6094 4MB IDT79R4000 Secondary Cache 2833 drw 10 
Module Block 
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PRODUCTS 





RISC DEVELOPMENT 
SUPPORT PRODUCTS 


Integrated Device Technology, Inc. 





INTRODUCTION 


For engineers developing software and hardware products 
around the IDT79R3000 Instruction Set Architecture (ISA), 
which includes the IDT79R3000, IDT79R3001, and 
IDT79R3051 family of RISControllers, IDT offers three software 
products, several prototyping and evaluation systems, and 
two versions of its MacStation bundled with IDT’s software 
development tools. IDT is also an authorized reseller of MIPS 
workstations and software and has a close working relationship 
with many third parties who also provide support tools for this 
processor family. This catalog primarily focuses on products 
manufactured and sold directly by Integrated Device 
Technology. 


SOFTWARE PRODUCTS 


IDT/c—IDT’s optimizing ANSI C-compiler. This compiler, 
which uses the Gnu C front end, includes full ANSI C com- 
patibility and highly efficient floating point emulation libraries 
for IDT79R3051-based systems (without hardware floating 
point). A unique debug control scripting language makes it 
easy to locate hardware problems that occur only under rare 
conditions. IDT/c includes the compiler, optimizer, assembler, 
linker, librarian, C libraries, Floating Point Libraries, and 
symbolic debugger. 

IDT/sim—IDT/sim is IDT’s System Integration Manager, 
used to bring up new hardware and to support the symbolic 
debug in both the MIPS and IDT C compilers. IDT/sim is a 
ROMable debug kernal with extensive diagnostics built-in. It 
is supplied in EPROM on all IDT prototyping boards, and is 
available in Source code for use with either the MIPS or IDT C 
Compilers. 

IDT/kit—!DT/kit is our newest software development prod- 
uct: The Kernel integration ToolKit. It contains source code 
and compiled versions of a complete set of routines for 
initializing systems, servicing interrupts, handling floating 
point exceptions, and so forth. Also included is source code 
for ANSI libraries, for the Floating Point Emulation Libraries 
and for transcendental functions. 


PROTOTYPING SYSTEMS 


Completely assembled and tested hardware systems are 
available for prototyping and initial software porting. Allinclude 
a CPU, serial /O, EPROM containing the IDT/sim monitor, 
and some amount of RAM. All have provision for simple 
addition of user-defined hardware. Units are available with 
the 3051/2 CPU, with the IDT79R3000 and IDT79R3010, and 
with high-performance CPU modules containing the 
IDT79R3000/R3010 and up to 256 KB each of I- and D- cache. 

For laser printer controllers, the IDT79S389 Reference 
Platform provides a ready prototyping target for R3051 Family 
laser printer controllers using PostScript™ Level 2 software 
from Adobe. 


MacStation DEVELOPER SYSTEMS 


The MacStation Developer Systems include an 
IDT79R3000-based CPU card, a Macintosh || computer, a 19” 
monochrome monitor, the Apple extended keyboard, the 
complete Unix software package (MIPS RISC/os with NFS 
and X), and an external hard disk large enough for the Unix file 
systems. The Developer Systems include all three of the 
software tools listed above. IDT is an authorized Apple VAR 
for these systems. 


CUSTOM DEVELOPMENT PROGRAMS 


IDT candesign and manufacture your R3000 based product 
for you. We manufacture very high performance CPU mod- 
ules for a number of IDT79R3000 users, including Pyramid 
Technology andAT&T. These modules are designed according 
to your needs and are manufactured on 8 to 10 layer FR-4 
boards using surface mount components on both sides of the 
board for the tightest possible layouts. We have built thousands 
of these modules at speeds of 33MHz and higher. 

We have also designed and manufactured IDT79R3000 
and IDT79R3051 board level products for lower performance, 
more cost-sensitive applications such as laser printers. 

In addition to our extensive design and layout experience 
with IDT79R3000 based products, IDT has a great deal of 
experience in indentifying problems as new software is brought 
up on in new hardware. This skill helps us bring up new 
systems very quickly. In production test, we use software 
routines that have been developed over several years of 
volume manufacturing experience to ensure that products 
operate reliably under worst-case conditions. 


THIRD PARTY DEVELOPMENT TOOLS 


The increasing popularity of IDT’s RISController family has 
resulted in a dramatic increase in the number of third party 
tools available. For information on these products, contact 
your local IDT sales representative. 
¢ Real-Time Operating Systems from Lynx, Ready Sys- 
tems, and Wind River 
Compilers from MIPS, Green Hills, and BSO Tasking 
VME Boards from CES, RISQ Modular Systems, 
Omnibyte, and Sanders Associates 
Device Simulation Models from LMSI, Zycad and HDL 
Systems 
Peripheral Support Circuits from V3 and National Semi- 
conductor 
Page Description Language interpreters from Peerless 
and Adobe Systems 
In-Circuit Emulators from Embedded Performance, Inc. 
¢ Logic Analyzer support from Hewlett-Packard, Fluke 

Instruments, and Tektronix 


e 
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THIRD-PARTY DEVELOPMENT TOOLS AND 
APPLICATIONS SOFTWARE FOR IDT RISC 





PROCESSORS 





Integrated Device Technology, Inc. 





OVERVIEW 


The MIPS/IDT RISC Microprocessor family is supported by 
a wide variety of third-party development tools and applica- 
tions software. Many of these tools are software products, 
useful across the entire line of processors; others of these are 
hardware development tools, appropriate for one or two 
members fo the family. 


As the MIPS architecture is increasingly popular and suc- 
cessful, many newtools are constantly being announced. IDT 
encourages our customers to work closely with their local 
sales representative for a current list of third party support. 
This listing is intended to be reasonably current as of the data 
of this document. 


Type Vendor Product Name Phone 
Logic Analyzer 
Arium ML4400 714) 978-9531 
800) 862-7486 
Biomation 


408) 988-6800 


310) 926-6727 


(714) 
(800) 
CLAS 4000 (800) 538-9320 
(408) 
Hewlett Packard (310) 


Support thru Corelis 


For HP16500 / 1650 
Fluke PM3580 (800) 44-FLUKE 
Tektronix DAS9200 (800) 426-2200 
(503) 627-7111 
In-Circuit Emulator 
Embedded Performance (408) 980-8833 
Simulation Models 
Mentor Graphics HML (800) 547-7390 
VIEWlogic (508) 480-0881 
HDL Systems Verilog (408) 432-3209 
Protocol Model Bank (201) 347-7900 


Ada Development on Vax host 


DDC | (212) 696-3700 
Telesoft (619) 457-2700 
Alsys (617) 270-0030 
Verdix (800) 289-8237 
VME Board 
RISQ Modular Systems (415) 490-0732 
Lockheed Sanders, Inc. (603) 885-6022 
Omnibyte (708) 231-6880 
Heurikon (608) 831-0900 
Aeon (505) 828-9120 


Real-time Operating Systems 





Mi C-EXECUTIVE™ (215) 628-0846 
Lynx OS Lynx™ (408) 354-7770 
Wind River Sys. VxWorks™ (415) 748-4100 
Accelerated Tech. Nuecleus (205) 661-5770 
Ready Systems VRTX (800) 228-1249 








©1992 Integrated Device Technology, Inc. 
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THIRD PARTY DEVELOPMENT TOOLS AND APPLICATIONS SOFTWARE =~ 





Type Vendor 
Laser Printer Languages 
Adobe Systems 
Microsoft 


~ Peerless Group 


Compiler Products 
BSO Tasking 
Greenhills 
Support Peripherals 
V3 Corporation 
Chips & 
Technologies 
UNIX Support 
UniSoft 
FDDI Support 


XLNT Designs Inc. 


Product Name 


Postscript™ PDL 
Truelmage™ 
PeerlessPage™ OS 


DRAM/DMA Controller 


RPC 
SVR4 


Plug-n-Play 


7.1 


Phone 


(415) 961-4400 


(206) 882-8080 
(213) 536 0908 


(617) 320-9400 
(805) 965-6044 


(416) 285-9188 


(408) 434-0600 
(510) 420-6400 


(619) 487-9320 








TOOLS: 





Integrated Device Technology, Inc. 


IDT/MIPS DEVELOPMENT 





PRODUCT 
OVERVIEW 


SYSTEMS AND SOFTWARE 





FEATURES 

* Based on Very High Performance MIPS® RISC 
Technology 

Binary compatibility across the complete line of MIPS 
RISComputers™, RISCstations™ and the IDT 
MacStation™ family 

Native Development Environment for MIPS ISA CPUs 
RISC/os™ MIPS SVID compliant UNIX® operating 
system with System V and BSD converged 

World class optimizing compilers including C, Pascal, 
ADA, PL/1, FORTRAN, COBOL 

Networking standards such as TCP/IP and NFS™ for 
connections to SUN, DEC, and other equipment 
X-Terminal support via RISCwindows™ 

Source level symbolic debugging 

Supports a wide range of development tools 

— System Programmer's Package 

— IDT/c including floating point emulation 

— IDT System Integration Manager 

— System modeling tools for performance projections 
MIPS native host platforms include: 

— IDT MacStation 

— Desktop units 

— Deskside units 

— Departmental servers 

— R3000 or R4000 based systems 

Cross development platforms include: 

— Sun-4 

— DEC Vax 


e 


DESCRIPTION 


This data sheet contains an overview of some of the tools 
and support available when developing microprocessor based 
systems using the MIPS development host platforms. These 
systems provide an easy-to-use, robust environment for the 
hardware and software development of today's high-perfor- 
mance RISC based systems, and greatly reduce the time and 
effort required to bring an application to market. 

As in any development environment, ongoing enhance- 
ments are made to enrich the tool set. Detailed information 
on current products and enhancements, new tools, and third- 
party support products is available from your local IDT sales 
representative. 


MacStation ana TargetSystem are trademarks of integrated Device Technoiogy, Inc. 
All other trademarks are trademarks ot their respective companies. 


MIPS DEVELOPMENT SYSTEMS 
CHARACTERISTICS 


MIPS' Development Systems, available from IDT, provide 
a large tool box when developing software and designing 
hardware architectures for any of the MIPS ISA CPUs. Based 
on the UNIX operating system, RISC/os is SVID compliant, 
converging BSD and System V. Because RISC/os is a 
multiuser, multi-tasking operating system, many users can be 
active on the host at the same time and each user may initiate 
multiple tasks. These include simple multi-tasking capabilities 
from print spooling —the ability to queue listings to print while 
still compiling or editing—to more than 200 users all compil- 
ing, editing, linking, etc. 

User protection indigenous to UNIX, easily managed by the 
System Administrator, is a huge asset in developing, testing 
and maintaining systems software. Configuration Manage- 
ment, the job of software management, is much easier with 
these user isolation capabilities. For example, each user can 
belong to a group which is developing a certain section of the 
project. As a user builds and tests a subprogram, he submits 
that tested unit to Configuration Management for archive. The 
librarian can give other users access, but they cannot modify 
the code. This enforces coding rules as code is reviewed 
before it is accepted. The style may seem restrictive, but has 
many benefits: it ensures reliable code inthe library, enforces 
testing procedures, and keeps code stable. 

Many features of standard UNIX operating systems ease 
the burden of program development. VI, the screen editor, 
(instead of a line editor), is a simple and powerful text editor 
that can be used for program entry. It has all the standard text 
editor features, plus an array of advanced capabilities for the 
experienced programmer. Saving and backing up important 
files is easily achieved by use of the TAR command. TAR, 
used for streaming tape backup, lists directories on tapes, can 
extract files from tapes, compare them, and manipulate them 
easily. Lastly, the Makefile command helps compile and link 
many subprograms easily. It is a simple list of all the files you 
need to compile and then link together. If during the debug 
process one subprogram is modified, invoking Makefile will 
find the single subprogram that has to be compiled again (by 
time/date stamps) and then relink automatically. 

The CPU bandwidth required to be efficient at doing 
multitasking is well beyond the capability of today's medium 
range PC or Macintosh computers. Many advances in per- 
sonal computer technology have stretched these single user 
architectures into multiuser domains, butto reduce the latency 
when adapting these systems requires more raw MIPS. 
These MIPS development hosts are thus ideal for the medium 
to large scale development team required for today's high- 
performance RISC-based applications. 
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IDT/MIPS DEVELOPMENT TOOLS: SYSTEMS AND SOFTWARE 


PRODUCT OVERVIEW 





CONNECTIVITY 


The MIPS Development Platforms have been designed to 


easily integrate into existing computer environments. In most. 


cases, these platforms easily integrate and allow the users to 
access the system through their existing terminals, create 
source files with existing terminals for download,.compile and 
link, or more extensively use the systems file and printer 
resources. Solutions to a wide variety of integration goals and 
computing environments are readily available as parts of the 
MIPS Development platforms. 


RS-232: Connections 


The simplest method to interface to the MIPS Spovelanhart 
Platform is to attach RS-232 terminals directly. There are 
typically anumber of direct connections, including Comm for 
the system console (where the system is booted and system 
administration functions are often performed). These terminal 
ports allow a number of different RS-232 devices to be 
connected to the host system, including IBM compatible PCs 
running terminal emulation software or modem connections 
for telephone dial-up. BAUD rates, parity, and other commu- 
nication protocol information are easily configured. 


Network Connections and X-Terminals 

MIPS development systems are capable of integrating into 
all common computer networks, including Ethernet and 
DECnet. If the current environment includes thick or thin wire 
Ethernet the connection is simple, since MIPS systems sup- 
port NFS-Network File System. NFS handles file transfers 
across a network, making all of the file resources of the 
network appear as if they belong in each system on the 
network. Since TCP/IP runs on the system, Ethernet file 
transfer and file sharing is transparent. This allows MIPS 
systems to easily connect to DECstations, Sun workstations, 
and any other Ethernet system using these conventions. 
Using Ethernet, X Terminal support is provided by 
RISCwindows™. This is a MIPS native implementation of the 
standard X Window System™ combined with OSF Motif™. 
Alternately, the RISComm software package can be used to 
integrate the MIPS host into the DECnet computer network. 


PC Compatible Connections 


IBM PCs or compatibles can be connected directly through 
RS-232 or Ethernet. For RS-232 connections, a simple terminal 
emulator like Procomm Plus works well. The programmer can 
create source code using DOS applications and then transfer 
it to the MIPS platform using the Kermit protocol inside of 
Procomm Plus. Alternatively, the programmer can share files 
in the PC and the system by using products from Novell. They 
provide a software tool that runs in both systems allowing the 
user to compile files that exist on the PC, or run application 
programs on the PC and use the file system on the MIPS 
platform transparently. Finally, IDT has produced a cross- 
compiler, including remote symbolic debug, for direct use on 
a PC platform. 


Macintosh 


Several solutions are available to utilize aneiing Macinteeh 
equipment. uShare™ provides a communication network 


| between the Apple Macintosh and MIPS platforms. Once in 


place, it: provides Mac users with server capabilities, elec- 


tronic mail, print spooling, data backup, and terminal emula- 


tion allowing interactive work..This environment allows pro- 
grammers to create code on the Mac using standard word 
processing applications, then transfer the file to the MIPS 
platform for compile and test. Caymen makes The Gator Box, 
which provides a gateway or bridge from Apple-Talk to the 
MIPS platform. Disk sharing and file sharing are acapable with 
this tool as well. 


MIPS OPTIMIZING COMPILERS 


A very important aspect of performance and development 
ease is the compiler technology. Fortunately, this is a funda- 
mental strength of MIPS. 

The MIPS instruction set and compiler technology were 
codeveloped before any silicon was architected. Many years 
were expended developing, tuning, and testing the instruction 
set and the ability of compiler technology to generate efficient 
object code from high-level languages. After this rigorous 
analysis, the silicon trade-offs were made. The result is a chip 
and compiler toolset designed to work with each other; this is 
dramatically different from traditional CPU/compiler develop- 
ment, where a chip manufacturer designs a chip and then 
expects compiler writers to "do their best with it". 

MIPS optimizing compiler technology, widely regarded as 
the bestin the industry, is based on mathematical models that 
work 100% of the time. This is substantially better than the 

"portable compiler" technology used for many other proces- 
sors, which frequently uses heuristic rules for optimization that 
may not result in as robust operation. Today, five levels of 
compiler optimization are available. Each individual level 
further optimizes code size or execution speed. Local optimi- 
zations include: Optimal Calling Sequences, Branch to Branch 
Optimizations, Local Common Subexpression, Schedule FP 
Units, Pipeline Scheduling, Constant Folding, and Code Se- 
lection. Global optimizations, which optimize across proce- 
dure boundaries include: Moving invariant code out of loops, 
Strength Reduction, Register Assignment, Global Common 
Subexpression, Shrinkwrapping, Inter-procedural Register 
Allocation, and Procedure Inlining. : 

The structure of the MIPS compilers exemplifies "State of 
the Art" design; the compiler suite is designed to use a 
common optimizing structure, called the back end, and mul- 
tiple language parsers or front ends. This foundation yields 
many benefits: improvement in the back end improves all 
languages, and more front ends (languages) can be easily 
added at any time and obtain the full benefit of MIPS optimi- 
zations. Also, the structure allows combinations of languages, 
including assembly, to be incorporated in a single program. 
MIPS compiler suite includes C, FORTRAN, Pascal, ADA, 
COBOL, PL/1, assembly. Languages from third party vendors 
are available as well. 

MIPS C cross compiler is also available for the Sun-4 and 
DEC VAX environments, allowing this state of the art optimiz- 
ing compiler to be brought to a cross-development environ- 
ment. 
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TOOLS FOR DEVELOPMENT 


MIPS development hosts include support for both hard- 
ware planning, software development, dnd software integra- 
tion. These tools allow application architects to define a 
system capable of meeting the cost/performance goals of the 
application, allow the software to be developed quickly and 
efficiently, and ease the process of integration of the applica- 
tion software onto the final target system. 

Many of these tools are now available in either Source or 
binary form, and have been re-hosted to the Sun-4 and DEC 
VAX environments. In addition, these tools are available in 
versions to support either R3000 (including IDT R3051 family) 
or R4000-based CPU development. 


Hardware Architecture Evaluation 

The quality of an application system is determined by its 
ability to meet the needs of its marketplace. Good systems are 
well planned, well thought out, well designed, and are main- 
tainable. Tools for hardware architecture definition help mea- 
sure how close the final system will actually come to meeting 
the systems’ predefined goal, before time and money are 
spent on developing prototypes. MIPS tools include the ability 
to accurately gauge the performance of application software 
on a given memory architecture, and allow the hardware 
architect to make cost performance trade-offs in the target 
system even before schematics are fully developed. These 
tools are contained in a product called the System 
Programmer's Package—SPP, and complement many of the 
software development tools discussed below. For example, 
Sable, a symbolic debugger/instruction simulator can be used 
to profile and develop kernel code. Pixie, Pixstats, and Profiler 
work to provide detailed information about the dynamic be- 
havior of the software in the proposed system including 
information about cache performance, "hot spots" and dead 
spots in the code, and the dynamic instruction mix used by that 
software. 


DBX Source-Level Debugger 


This standard debugger for UNIX is excellent for testing 
and finding software problems at the source code level. The 
debugger works for C, FORTRAN, Pascal, and assembly 
language. DBX is an easy tool to use; simply compile your 
code with the -g option (this keeps the symbol table from being 
stripped) and then invoke the debugger by typing “DBX 
command_file_name source_file_ name". The 
command _file_name allows a set of DBX commands to run 
for set up, like setting variables, setting breakpoints, running 
to a specific point and displaying variables. Simple commands 
for DBX are: RUN, LIST, DUMP (list data about current 
procedure), STEP, STOP AT line #, STOP INprocedure_name, 
UP/DOWN (traverse stack activation levels), etc. Complex 
commands like “stop VAR in PROCEDURE if EXPRESSION” 
are also supported. DBX lets you use all the resources within 
the language except coprocessor zero, which can be de- 
bugged using "Sable". 


Both the SPP and the IDT System Integration Manager 
allow the capabilities of DBX to be brought to source level 
debugging on a remote target system. This provides a familiar 
environment to the programmer as the task of integrating 
application software onto the target system is performed. 


Program Behavior Analysis Tools 


_ Pixie is a UNIX program that provides statistics about a 
program's dynamic behavior. One type of data collected 
shows all the called routines, how many times they were 
called, how many instructions per call, and the percent of time 
spent in that call. This information is used to find out where 
most of the program time is spent, allowing the programmer to 
focus on those routines which dramatically affect the perfor- 
mance of the application. Another type of data collected 
shows the locality of the code. This helps to predetermine how 
much cache would be appropriate for this program. It also 
goes into detail on floating point usage, integer multiply/divide 
usage, all possible interlocks, load/branch nops, and more. 
For it to collect this type of data, it actually instruments your 
object code; the instrumented code is executed normally, and 
the output is formatted using the pixstats (pixie statistics) 
program. 

The Profiler is another UNIX tool that further defines code 
behavior. The user can concentrate in specific areas of 
program behavior by focusing the profilers attention to those 
items. Separate lists can be generated covering procedure 
information. They are: sorted by total time spent in each 
procedure, sorted by times called, sorted by number of cycles 
executed, sorted by number of clocks executed, sorted by 
number of clocks per line inside a procedure, and number of 
procedures never called. Again, this tool set allows the pro- 
grammer to tune those areas of the code which will most affect 
the end system performance, thus increasing programmer 
efficiency. 


The System Programmer's Package 


SPP is a toolbox containing programs that help system 
developers to build, test, and download software to the target 
hardware. Among the utilities included in the SPP are the 
following: 
¢ A hardware environment simulator which allows the 

system architect to evaluate the performance of different 

types of memory systems 

¢ A software environment simulator which allows symbolic 
kernel code debug 

¢ Aset of tools for downloading into the target machine, 
and interfacing to DBX to perform remote target source 
level debugging. 

* Common I/O routines and drivers used in many systems. 

SPP is provided in source or binary form and is written in 
C, allowing for easy modification and tailoring. 
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SUMMARY 3 3 ~>— a ee ADDITIONAL INFORMATION 


_ MIPS Development platforms provide all the tools neces- Additional information about specific models and éapablll 
sary for hardware and software system design. UNIX comple- ties is available from your local IDT sales representative. . 

ments the tool set by providing an engineering environment Bae ty : | , os 

and communication capabilities to support a large or small 

engineering team. This environment supports the full devel- 

opment cycle required to be successful with and quick to 

market with epee: based on the IDT/MIPS RISC pro- 

cessor family. 
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TRAINING CLASS 
APPLICATIONS DEVELOPMENT WITH THE 


IDT R3051™/R3081™ FAMILY OF RISControllers™ 








OVERVIEW 


IDT offers a training class intended to provide in-depth 
knowledge on the use and capabilities of the R3051/R3081 
family of processors. The class is intended to provide an 
accurate basis for device evaluation, as well as to provide a 
design engineer with the ability to rapidly bring an R3051/81 
application to production. 

The class is thus intended for engineers who are designing 
with the processor family, and who wish to minimize time to 
market. It is also appropriate for customers performing a 
detailed processor survey prior to device selection. 


COURSE CONTENTS 

The course provides a detailed discussion, including hands- 
on workshops, of both the hardware and software consider- 
ations appropriate to applications development. While the 


course does assume basic familiarity with hardware and 
software development, the course does not assume previous 
RISC training or experience. 

The course prepares the participant to create designs 
around the R3051 family through detailed lecture and work- 
shops. The programming envirnomentis reviewed, as are the 
various hardware price-performance tradeoffs available. 

A detailed outline of the course is contained on the next 
page. | | OS 


COURSE LOCATION AND SCHEDULE 

The course is held on three consecutive days at our Santa 
Clara, California facility. Directions, accomodations, and 
schedule information is available from your local sales repre- 
sentative. 
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CLASS: APPLICATIONS DEVELOPMENT WITH IDT R3051 FAMILY 





 DAY1 


Device Overview — 

¢ CPU Integer Unit ; 

* Floating Point Accelerator — 

¢ System Control Co-Processor 
¢ On-chip Caches 3 

¢ System Interface 


Instruction Set Architecture | 
* Overview a 

* Register Model | 

¢ Instruction Set Details - - 

* Co-processor Operations 


IDT/sim 
¢ Overview 
¢ Commands 


Workshop: Using IDT/sim 
¢ Instruction decoding 

* Command Set 

¢ Program Assembly 


Cache Architecture 
* Cache Architecture 
* Operation 

¢ Flushing 

¢« Performance 


Memory Management 

¢ Overview 

¢ Virtual to Physical Address Translation 
¢ TLB Operation 


Exception Handling 

¢ Precise Exception Model 
¢ Exception Processing 

* Software Techniques 


os Exception Latency 


¢ Special Techniques 


Workshop: IDT R3051 Family Evaluation Board 


DAY 2 


System Interface - 


¢ Operations Priority 


~ ¢ Execution Engine Fundamentals 


*« Read Interface 
* Write Interface 


Workshop: Reading and Writing a Memory Mapped ~ 
Register | | _ 


DRAM Interface | 
* Discrete Implementation Techniques 
¢ The R3721 DRAM controller | 


Workshop: DRAM Control 


Reset/Clocking Interface 
* Mode selectable options 
¢ Input and output clocks 


Workshop: Reset Interface 


High-Level Language Program Development 
¢ Compiler overview 
¢ IDT/c multi-host compiler 


Workshop: Program Development 


DAY 3 


7RS385 Evaluation Board Design Review 
¢ Board Overview | 

* Memory Timing 

¢ I/O Timing 


Floating Point Options 
¢ Hardware floating point 
¢ Software Emulation 


Workshop: Motor Control 


Software Development 

¢ Simulation tools 

¢ IDT/kit 

¢ IDT/sim 

¢ Remote target symbolic debug 


Workshop: Software Development 
Subsystem Products 
* Module Products 


¢ Interface Techniques 


Workshop: Hardware and Software Integration 
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FASTX™ COLOR X-TERMINAL 
REFERENCE PLATFORM 


FOR IDT R3051 FAMILY 





ADVANCE 
INFORMATION 
IDT79S3901 





FEATURES: 





IDT79R3051E/3052E/3081E RlSController™ CPU 
2MB, 4MB, 8MB, or 16MB DRAM in up to 4 SIMM 
modules 

V3 Burst memory DRAM controller 

2MB Video RAM video buffer 

Up to 1MB ROM or EPROM 

Supports up to 25MHz processor-memory interface 
Programmable resolution up to 1280 x 1024 

8 bits of color per pixel 

Programmable refresh rate 

Standard PC/AT keyboard 

Two RS-232C serial ports, up to 19.2Kbaud 

IEEE 802.3 10base5 and 10base2 Ethernet interface 
Serial EEPROM stores configuration settings 
On-board speaker driver port 

Expansion/debug connector 

Interval timer for interrupts from 3.9ms to 500ms 
+5VDc, +12VbDc | | 


DESCRIPTION: 


IDT’s FASTX Reference Platform provides a hardware 
baseline design, complete with X11 software for evaluation of 
cost-performance points and different OEM implementation 
options. The IDT R3051 family offers the widest range of 
R!SController™ cost-performance options based on pin-com- 
patible choices of cache, frequency and floating point options. 

FASTX includes a programmable frame buffer supporting 
resolutions up to 1280 x 1024. Its small size and single 
processor orientation make it an excellent starting point for 
low-cost, high-performance designs. Cost-performance points 
can be easily adjusted by CPU clock frequency, CPU choice 
(cache size options) and the amount of memory installed. 


FASTX Board. Actual Size 10.75" x 8.5" x 1.5" 


The IDT logo ts a registered trademark and FASTX, RiSControiler and R3051 are trademarks of integrated Device Technology, inc. 


All others are trademarks of their respective companies. ; 
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IDT R3051" FAMILY ~ ADVANCE 
LASER PRINTER CONTROLLER ee 
REFERENCE PLATFORM FOR | 1 oD TTIS989 
PostScript” Level 2 SOFTWARE FROM ADOBE® 







Integrated Device Technology, Inc. | 





FEATURES * Programmable DUART eee! with RS232C and Apple- 
; Talk® ports 
¢ Software-ready laser printer controller suitable for Adobe 
a seveloeing Scan Level 2 selec * SCSI Controller (53C80) with one SCSI port (2 connector 
locations) 


IDT/Adobe demonstration platform for PostScript Level 2 
software running on IDT's R3051 RiSController™ family 
IDT/OEM R3041/R3051/R3081 based prototyping target 
and reference design (25MHz) 

Uses IDT79R3721 DRAM Controller and IDT73720 Bus 
Exchangers 

Options for two-way interleaved or non-interleaved 
(jumpers) DRAM memory system 

— Up to 16MB DRAM (four ae sockets; 1 or 4MB 


e 


Centronics parallel input port. 

Adobe reference front panel interface (based on Canon 
LBP-8 MARKIIIR 6-button/LCD/LED front panel) 

IDT FIFO-based Canon video interface to LBP-SX/RX 
engines | 

Clock, reset and interrupt generation logic 

Expansion bus connector for:j | 

— Custom engine interfaces (600dpi, color, etc.) 

— Additional I/O (Ethernet, Adobe FAX, etc.) 


e 
e 


SIMMs) 
—- 4non-interleaved banks or 2 two-way interleaved - Additional font ROM space 
banks ¢ Shipped with IDT/sim™ initialization and monitor debug 


software (PostScript EPROMs available from Adobe) 
EPROM/ROM memory system Executes various Adobe software (provided only under 
— Up to 4MB ROM (8 32-pin sockets; 1, 2 or 4Mb license from Adobe Systems Incorporated), including: 
EPROM/ROMs) — Adobe's high-level and low-level monitors 
— Adobe Print Architecture 
— Adobe's PostScript Level 2 Interpreter 


e 


Options for two-way interleaved or non-interleaved 


- 2non-interleaved banks or 1 two-way interleaved 
bank 
512 bytes serial EEROM 


e 


SCSI 
Parallel Irs ea ne 
Interface .. ab og R3041 
| R3051/52 


or | 
R3081 Video ee 


| LBP. -RX 
| | : | | 
Appletalk 4 eo rin 
Serial | eo ' | | | . Engine 
Interface ; 





Centronics 
Parallel 


Interface Adobe. 


Front 


Panel 
Interface 





RS232C 
Serial 


Interface 





| Expansion Connector | 
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Figure 1. IDT79S389 Block Diagram 


The IDT logo ts a registered trademark and R3041, R3051, R3081, RiSController, !DT/sim are trademarks of Integrated Device Technology, Inc. 
Adobe, the Adobe logo, PostScript, PixelBurst, and the PostScript logo are trademarks of Adobe Systems Incorporated which may be registered in certain jurisdictions 
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IDT79S389 R3051 Family Laser Printer Controller Reference Platform 





INTRODUCTION 


The IDT79S389 provides an R3051 family laser printer 
controller Reference Platform for rapid adaptation into OEM 
differentiated products using PostScript Level 2 software from 
Adobe. "Reference Platform" means that IDT and Adobe 
engineers have jointly developed both hardware and software 
modules for the specified configuration. This provides a 
baseline hardware and software design to accelerate time to 
market where changes can be limited to one or two areas 
(form factor, engine interface or I/O options), without having to 
Start at the beginning. 

Since the IDT R3051 family includes pin-compatible 
members with and without floating point accelerator hardware 
on chip, Adobe software licensees will be able to obtain "core 
PostScript" binaries in two versions: one compiled with the 
MIPS C compiler assuming the presence of the FPA (for 
IDT79R3081), and another version based on IDT's floating 
point emulation libraries (for IDT79R3041, R3051 and R3052). 

The |IDT79S389 is completely self contained, and is intended 
for use either on the desktop, or installed inside a variety of 
Canon print engines; e.g. Canon OEM engines LBP-SX and 
LBP-RX, Canon LBP-8 MARKIIIR and HP LaserJet III. The 
IDT79S389 fits into any of the above engine mounting loca- 
tions, including the standard power supply and video interface 
connections. For evaluation on the desktop, a PC-style 4-pin 
power supply connector is also provided. Figure 1 illustrates 
the simplified block diagram of the IDT79S389 Reference 
Platform. 


The IDT79S389 Reference Platform is designed around 
the R3051 RiSController family, including the IDT79R3081 
and the R3041. All devices in the R3051 family are pin and 
software compatible. As a consequence, R3041, R3051E, 
R3052, R8052E, R3081 and R3081E can be substituted for 
the R3051 throughout this manual. For details on the R3051 
family refer to the data sheets and hardware user manuals. 


SYSTEM OVERVIEW 


Figure 2 illustrates a high-level schematic of the data paths 
and various subsystems of the board. The User's Manual that 
ships with the board provides extensive detail on the board, 
including complete schematics, PAL equations, and theory of 
operation. 


Address and Data Path 

The R3051 family uses a time multiplexed address and 
data bus. The IDT79S389 demultiplexes this bus into an 
address bus and two data buses. The use of two data buses 
both minimizes the loading of the buses, and allows either or 
both of the EPROM and DRAM subsystems to be interleaved. 

The address path Is constructed using IDT 74FCT 162373 
16-bit wide transparent latches, and is de-multiplexed off the 
A/D bus using the processor supplied ALE output signal. 

The data paths are provided by a pair of IDT 73720 Bus 
Exchangers. The 73720 in general is a 3-port, 16-bit wide 
transceiver, used to multiplex a common CPU port between 
two data ports (typically found in two way interleaved sys- 


R3041 
R3051/52 


or 
R3081 


CPU A/D BUS 


ADDR LATCHES 
'373 


CPU ADDRESS BUS 


ADDRESS 

DECODING 
EPROM 
BANK 1 


Z PORT 


ODD DATA BUS 


8530 
SCC 


DMC 
aay SETTINGS 
REGISTER 


| ENGINE 
CONTROL/ 
STATUS 





X PORT 


BUS EXCHANGER 





VIA 
DRAM 6522 
BANKS 1&3 
cerow | | SRONT 


ORING 
DRAM | | 
CONTROLLER REGISTERS 
374 
VIDEO FIFO 
AND 
EVEN DATA BUS VIDEO INTERFACE } 


EPROM CENTRONICS 
BANK 0 REGISTERS | 
DRAM USER 
BANKS 0 & 2 AREA 


Y PORT 
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Figure 2. IDT79S389 High Level Schematics 
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1MB SIMM 


interleaved 


1MB SIMM 
non-interleaved 


0x0090_0000 -> 





Ox009F_FFFF 


Ox00A0_0000 -> 


Kosa OxOOAF_FFFF 
Ox00BO 0000-> . 


Ox00BF_FFFF 





OxOOAF_FFFF - 


tems). The control of the 73720 Bus Exchangers is performed 
by a dedicated PAL, which directs transfers between the CPU 
and the appropriate data bus, and insures that bus conflicts 
are avoided. ar 


State Machines | 

The IDT79S389 uses a distributed state machine structure 
to implement control of the various peripheral subsystems. In 
this structure, each peripheral subsystem has dedicated control 
PALs associated with it. These PALs monitor the start of a 
transaction, and either ignore the transaction (if intended for 
other subsystems), or provide the appropriate control 
responses back to the processor at the appropriate times, 
according to the latency of the targeted subsystem. A master 
PAL generates a common "Cycle End" indicator to all state 
machines, indicating that they can await another transaction. 

The advantage of this distributed state machine structure is 
that memory subsystems can be independently added, 
removed, or modified, without impacting the rest of the system. 
This simplifies end user customization and system debug. 

The disadvantage of this structure is that the number of 
PALs required is larger than if the state machines were 
centralized. . It.is expected that customers using this as a 
reference design would customize and/or cost reduce the 
- state machines and 1/O subsystems, using interface ASICs, 
ASSPs, or condensed PALs. 

Inaddition to the distributed state machines, the IDT79S389 
contains a number of PALs providing common functions to all 
state machines. These functions include address decoding, 
Cycle End generation, data path steering logic, bus timeout, 
and CPU input/response synchronization. 


CPU Subsystem ss : 
The IDT79S389 board incorporates the standard R3051 
family PLCC footprint. It is targeted to run at 25MHz, although 
its frequency may be scaled up or down, as appropriate. Note 
that when scaling frequency, the user should reprogram the 


wait states associated with the various memory and peripheral 


subsystems, and may need or choose to use faster or slower 
control and memory devices. The board and software do not 
require the use of a TLB. | 





0x0080_0000 -> 
0x009F_FFFF (odd) 


-0x00A0_0000 -> | 
OxOOAF_FFFF (even) | 0x013F_FFFF 


0x00A0__0000 -> 
(odd) 


Table 1. DRAM Memory Map 


| Banko. 0x0080_0000 -> 1 0x0080_0000 -> 
| | Ox008F_FFFF Ox009F_FFFF (even) 
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4MB SIMM | 
non-interleaved 


0x0080. 0000 -> 
0x00BF_FFFF 


| 0x00C0_0000 -> 
OxOOFF_FFFF 


0x0100_0000-> — 













4MB SIMM 
| interleaved 
| 0x0080_0000 -> 
| OxOOFF_FFFF (even 


0x0080_0000 -> 
_Ox00FF_FFFF (odd 
0x0100_0000 -> 
0x017F_FFFF (even) | 


0x0100_0000-> 
0x017F_FFFF (odd) | 


DRAM Subsystem | 2 

~ The DRAM subsystem of the IDT79S389 board supports 
the use of 256Kx32 or 1Mx32 72-pin SIMM memories. Up to 
4 SIMMs may be used, for a maximum of 16MB of DRAM 
memory. The memory can be interleaved or non-interleaved, 
according to a set of DIP switches. . 

The DRAM system is controlled by the IDT79R3721 DRAM 
controller. This device features an R3051 family bus interface, 
and implements direct control of the DRAM devices. The 
timing and configuration of the DRAMs is programmable in the 
R3721, according to the settings of an internal mode register. 

To maximize user flexibility without requiring PROM 
changes, the IDT79S389 memory maps a set of DIP switches, 















) 












) 










0x0140 0000 -> — 
0x017F_FFFF 


called the MSEL switches. At system startup, the value of 


these switches is read by the CPU and then written to the 
IDT79R3721 DRAM controller, to configure the system timing 


model. Thus, in order to change the memory configuration or 


timing, the user merely needs to set the DIP switches and 


. reset the board. 


The DRAM memory is memory mapped to the address 
space 0x0080_0000 to 0x017F_FFFF, depending on the size 
of SIMM, number of SIMMs, and interleaving chosen. Table 
1 illustrates the address map, depending on configuration. 
Table 2 illustrates the read and write latency (measured in 
clock cycles) of the various memory configurations, assuming _ 
80ns SIMMs and a 25M system. | 

The IDT79S389 board is shipped with two 1MB 80ns 


interleaved Non-Interleaved 





First Word of Read 
Adjacent words — 


5 
4 
Non-page Write | 4 


Page Write 
Table 2. Number of Clock Cycles for Various DRAM Transfers 








ai See 
ae en 
Poe 





SIMMs in a non-interleaved configuration. Additional SIMMs 
can be added by the user, and interleaving can easily be 
selected. 
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Bank 0 
(non-interleaved) 


Bank 0 0x1FCO_ 0000 -> 
(Interleaved) Ox1FCF_FFFF (even) 
Bank 1 Ox1FCO 0000 -> 





Ox1FCF_FFFF (odd 





interleaved 


PM EPROM | 2MbEPROM | 4MbEPROM 
Ox1FC7_FFFF 0x1FDF_FFFF 
(non-interleaved) Ox1FCF_FFFF 0x1FDF_FFFF Ox1FFF_FFFF 
0x1FCO_0000 -> 

Ox1FDF_FFFF (even) 


0x1FCO_0000 -> Ox1FCO_0000 -> | 
Oxi1FDF_FFFF (odd Ox1FFF_FFFF (odd 














Ox1FCO 0000 -> 
Ox1FFF_FFFF (even) 





Table 3. EPROM Address Map 


EPROM Subsystem 

The EPROM subsystem contains 8 sockets, capable of 
accepting 1Mb, 2Mb, or 4Mb devices. The sockets accept 8- 
bit wide EPROMs in the DIP package. 

The board can be used with either 4 or 8 EPROM devices; 
if 8 devices are used, Interleaved or non-interleaved operation 
canbe selected. The density of EPROM, and the interleaving 
factor, are selected via jumpers and PALs for the board. The 
board ships with 512KB of 120ns EPROM installed ina single 
bank; the EPROMs contain the IDT/sim monitor program 
ported to this board. 

The EPROMs reside in the physical address range 
0x1FCO_0000 through Ox1FFF_FFFF. This address space 
includes the system exception vectors, as well as the bootup 
code, and can be accessed either through or around the on- 
chip processor cache, according to the virtual address used. 
Table 3 shows the physical address map for the EPROMs. 
Table 4 shows the memory latency of the EPROM subsystem, 
for 120ns EPROMs and a 25MHz system. 


SCSI Subsystem 

The IDT79S389 board contains a single SCSI channel, 
implemented using the 53C80 SCSI controller. Although 
there is only one channel, there are two SCS! connectors on 
the board, to support the differences in the form factor of the 
various laser engines supported. 

The SCSI device resides in the address range 0x0074_ 0000 
through 0x0074_FFFF. 


Serial Channels Subsystem 

The IDT79S389 board implements two serial channels. 
One is a traditional RS-232 channel, and is accessed by a DB- 
25 connector. The other channel supports AppleTalk, and 
uses the standard AppleTalk connector. The board includes 
voltage translators and transceivers to implement the electri- 
cal protocols required by these standards. 







ee Interleaved Non-Interleaved 
| First Word of Read 1 5 | | S 
| Adjacent words | a 1.6 (1-3-1) ! 


Table 4. Number of Clock Cycles for Various EPROM Transfers 







The serial channeis are implemented using a single 85C30 
SCC serial controller. The address space for the serial 
controller is 0x0073_0000 through 0x0073_FFFF. 


EEROM Interface 


The IDT79S389 board includes a 512B EEROM to store 
various configuration data. The EEROM Is accessed by the 
65C22 VIA device, which is memory mapped to 0x0071_ 0000 
through 0x0071_FFFF. 


Centronics Interface _ 

The board also includes a unidirectional Centronics port. 
Centronics data is read from address space 0x0075 0000 
through 0x0075_FFFF; Centronics status is written in the 
address space 0x0076_0000 through 0x0076_FFFF. 


Front Panel Interface 

The front panel interface corresponds to a Canon LBP-8 
Mark I|IR, and uses a series of switches, LEDs, and LCDs to 
implement front panel control. Front panel is accessed by the 
65C22 VIA device, which is memory mapped to 0x0071_ 0000 
through 0x0071_FFFF. 


Video Interface 

The video interface corresponds to the interface requires 
for the Canon LBP-8 Mark IIIR, based on the Canon LBP-RX 
print engine. The video interface is implemented using discrete 
logic, with status taken from the 65C22. 

Video data is sent to the video interface by performing an 
aliased read of the DRAM memory. If a processor read of the 
16MB region starting at 0x0880_0000 is detected, the access 
will be processed as a DRAM read. However, the read data 
returned from the DRAMwillbe captured by the video interface, 
and later shifted out to the print engine. This technique 
eliminates overhead by not requiring the processor to explicitly 
write the data to the video channel. | | 


User Expansion Area 

In addition to the memory systems described above, the 
IDT79S389 board contains a user expansion connector. The 
user expansion connector allows users to add custom features 
to the board for software development. Features which could 
be added might include an Ethernet channel, additional font 
ROM, or a different engine and front panel interface. 

The IDT79S389 board provides a User Chip Select, mapped 
to address 0x0078 0000 through 0x0078_FFFF, for use with 
the expansion connector. 





7.5 





IDT79S389 R3051 Family Laser Printer Controller Reference Platform 





Board Form Factor 

The form factor and hole placement of the board allows it 
to be directly mounted into either a Canon LBP-8 Mark IIIR 
laser printer, the HP LaserJet Ill, or the Canon OEM print 
engines LBP-SX or LBP-RX engines. The placement of the 
video, front panel, and power connectors, are compatible with 
these form factors. | 

In addition, the board can be run on a benchtop using a 
standard PC compatible power supply. If the board is used in 
this fashion to drive an engine, it is recommended that a 
common ground between the board and the engine be provided. 


Summary: Address Map and Interrupt Assignment 


Table 5is asummary of the address map of the IDT79S389 
board. Table 6 shows the interrupt assignments of the CPU. 


Memory Subsystem | Start Address | End Address 
0x0071_0000_| 0x0071_FFFF 
Isso 0x0073_0000 | 0x0073_FFFF 






















DRAM | 0x0080_0000 | 0x017F_FFFF 
Aliased Video DRAM | 0x0880_0000 | 0x097F_FFFF 
EPROM Ox1FCO_0000 | 0x1FFF_FFFF 


Table 5. IDT79S389 Memory Map Summary 


| R3721 Mode Register | 0x007A_0000 | O0x007A_FFFF 











| Device | 
| Reserved | nt) 
VA 

@) 

(4 


5 15 


HFullVideo Reset 


=i 


| SCSI | 


Table 6. IDT79S389 Interrupt Assignment 


SCSI port: 


CPU Interrupt = 


SPECIFICATION SUMMARY 


Order Number: ——‘ 1DT79S389 


Maximum on board memory capacity: 


DRAM Four 72-pin SIMM sockets for 
03 256Kx32 or 1Mx32 (1 to 16 MB) - 
EPROM Eight 32-pin sockets for 128Kx8 
; | to 512Kx8 (to4 MB) | * 
Serial EEROM One 8-pin socket for serial 


EEROM (512 bytes) | 


Debug Monitor EPROM: 
IDT/sim Version 4.0 ported to 
IDT79S389 | | 


Serial Ports: 
Serial 


Controlled by 85C30 DUART. 
CRT terminal connector or for 
downloading J3 (25-pin AMP 
7481 33-1,DB25S, right angle - 
female). 

AppleTalk connector J2 (8-pin 
AMP 749179-1, D8 8, right angle 
female). — - 


Appletalk 


Parallel port: _ 
Centronics — 36-pin, female, right angle, 
standard Centronics parallel 
connector (R.Nugent RPM- | 


C36SB-SR-TG). 


Controlled by 53C80 SCSI 
controller. SCSI connector J5 or 
J10 (50-pin, female, right angle, 
(R.Nugent RPM-C50SB-SR-TG). 


Standard 20-pin, male Canon 
LBP-RX video interface 
connector (HIROSE PCN-10-20P 
2.54DSA). | 


Video: 


J7, 34-pin male, right angle 
connector (AMP 1-103149-7). 


Front Panel: 


Four 40-pin male, four wall 
headers, J11-14 
(MOLEX 39-26-7404). 


Expansion: 


‘Compatible with Canon RX, SX 
— engine form factors. 


Physical: 


Operating Temp:  0-50°C. 


5.0V +- 5%, 3 Amps typical 
(estimate). 


Power Supply: 
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PHYSICAL LAYOUT | 3. Advanced hardware starting point for rapid evaluation, 
The physical layout of the IDT79S389 Reference Platform cost-performance point analysis and development of 
OEM finished products. 


reflects the board’s primary objectives: - 
4. Advanced software-ready controller, suitable for 


1. Software delivery vehicle for PostScript Level 2 ; 
eatiware from Adobe immediate development with PostScript Level 2 
— Memory space appropriate for PostScript Level 2 software from Adobe, and adaptation to other print 
software typical implementations, engines and communications ports (Adobe software 
— Various memory configurations (interleaved available only under license from Adobe Systems 
vs non-interleaved, code running out of DRAM or Incorporated). 
_ outof ROM) to easily evaluate cost and performance 
alternatives. 
2. Cost-Effective design model for IDT79R3051 
RlSController family 
— NO zero-wait-state memory, 
— Minimum complexity board configuration (6 layers), 
— Fits industry standard Canon LBP-RX print engine. 
SCSI Connector Locations 3051 or R3081 Centronics 


(FP lib or hardware) 


RS232C 


EPROM/ROM Sockets RESET 
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Figure 3. IDT79S389 Board Layout 
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IDT79R4000 FLEXI-CACHE™ 
DEVELOPMENT TOOL 





| Integrated Device Technology, Inc. 


FEATURES: 

Hardware Development Tool for implementing various con- 
figurations of the secondary cache requirement for the 
IDT79R4000 CPU 

Configurable in various cache sizes, number of words per 
line size, and split vs. unified cache operation 

Move from prototype/development to production with no re- 
design by using pin compatible “production grade” 
IDT79R4000 secondary cache modules 

Development module operating frequencies to support zero 
wait-state 50MHz IDT79R4000 operation 

Four identical 80 lead gold-plated SIMMs (Single In-Line 
Memory Modules) support each IDT79R4000 CPU 
Surface mounted plastic components on a multilayer epoxy 
laminate (FR-4) substrate 

Multiple ground pins and decoupling capacitors for maxi- 
mum noise immunity 

TTL compatible I/Os 

Single 5V (10%) power supply 


e 


DESCRIPTION: 


The IDT7MP6048/7MP6068 is a Hardware Sadeloeniekt 
Tool used for implementing various configurations of the 
secondary cache requirement for the IDT79R4000 CPU. By 


~ PRELIMINARY 
IDT7MP6048 
IDT7MP6068 


changing jumpers on the modules, the designer can easily 
change certain characteristics (cache size, number of words 
per line, and split vs. unified operation) of the secondary cache 
in the lab. By running benchmarks on the actual system using 
these various cache configurations, the secondary cache 
which best optimizes system performance can be deter- 
mined. This development tool gives you cache performance 
benchmarks which are superior to benchmarks derived via 
simulation. 

Move from development to production without changing 
the secondary cache footprint by choosing pin compatible 
“production grade” IDT79R4000 secondary cache modules. 
These high performance, high density IDT modules are opti- 
mized to meet the customers’ exact cache requirements 
required for volume production of the system (please consult 
the factory for more details). 

The IDT7MP6048 is a 1MB secondary cache module block 
(four identical modules builds a complete cache to support 
each IDT79R4000 CPU) constructed on a multilayer, epoxy 
laminate substrate (FR-4) using 11 64K x 4 static RAMs and 


FBT logic drivers while the IDT7MP6068 is a 4MB secondary 


cache module block using 11 256K x 4 static RAMs and FBT 
logic drivers. Extremely high speeds can be achieved using 


high-performance BiCMOS IDT61B298 or IDT71B028 static 


RAMs and IDT74FBT2827 drivers. The FBT drivers have 





FUNCTIONAL BLOCK DIAGRAM) 


A1-17 a 7 
OE — 
TCS 

DCS 


74FBT2827 
x2 


NOTE: 


64K x 36 
or - 
256K xX 36 - 
DATA 





i I/O0-35 ‘ | T0-7 


2841 drw 01 


1. The Data and Tag sizes shown on the block diagram are only for the case when the jumpers are in the default positions for the respective modules. These 
sizes will change according to the jumper connections (see Jumper Connections on page 2). 


The IDT logo is a registered trademark and FLEXI-CACHE is a trademark of Integrated Device Technology, Inc. 


COMMERCIAL TEMPERATURE RANGE 


©1992 Integrated Device Technology, Inc. 


JUNE 1992 


7.6 DSC-7093/1 
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IDT79R4000 SECONDARY CACHE MODULES 


COMMERCIAL TEMPERATURE RANGE 





BICMOS 1/Os and internal 25Q series output resistors result- mance critical part of the system. All inputs and outputs of the 
ing in the fastest propagation times with minimal overshoots modules are TTL-compatible and operate from a single 5V 
and ringing. Multiple GND pins and on-board decoupling supply. Fully asynchronous circuitry is used, requiring no 
capacitors provide maximum noise immunity for this perfor- clocks or refresh for operation of the module. 





CACHE CONFIGURATIONS") 


Size line 
a ae 


NOTE: , 2841 tbl 01 
1. Please refer to the Jumper Connections for instructions on how to implementthe Cache 
Configurations shown above. 


JUMPER CONNECTIONS: 

















Cache 
Operation 
unified cache (default 
split cache 











~4MEG SPLIT OR UNIFIE 
| (7MP6068 DEFAULT) 





— OO hm O Oo @=@ eee O oO oO lh oO O fam O 
P3 6 P6 P3 65 P6 P3 5 P6 P3 | P6 
cI CO _—£ O oO OC i O oO ol CU oO OC fame O 
P2 ; P5 oO P2 p | P5 © P2 p § P5 © P2 4 P50 
| ne? © om ra CO | ([ibaweeeg) CJ Cc Gu=™® 
P 1 , P4 © P1 f P46 P1 5 P40 P1 d P40 
1 MEG UNIFIED 1 MEG SPLIT 512K UNIFIED 512K SPLIT 
(7MP6048 DEFAULT) 
oO OC i O oO OC =e O oO Oo hm O 
P3 P6 P3 | P6 P3 g PE 
oO f Oo tm O oO olUCKd 6h Ome oO o a ro) 128K SPLIT 
P2 P50 P2 ; | P50 po db P5 5 | NOT SUPPORTED 
Oo Dp oO ro) oO oO ro) oO oO ro) BY R4000 
Pi ob Pa ob pid Pa ob pid vee | 
256K UNIFIED 256K SPLIT 128K UNIFIED 
Cache line size is controlled by Jumpers P7-P9 as follows: 
mn ic) ic) oO r 
P9 O P29 Oo} PIO - Pg. 
a Gi oO g oO r 
P8 © P8 Oo Ps P8 
P7 O P7 f P7 P7 § 
4 WORDS/LINE 8 WORDS/LINE 16 WORDS/LINE 32 WORDS/LINE 
(DEFAULT) 2841 drw 02 
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PINCONFIGURATION™) = -— PINNAMES — 


Data Inputs/Outputs 




































~ Vec | GND | 
VOs es Address Inputs 
Os VO4 — — | 
Os tee | TS | sTagChipSelect 
eon 014 , = _| Write Enable | , 
yOr4 VOrs Output Enable | 
017 VOr6 3 | 
VOr19 /O18 GND Ground | ! 
/O21 O20 . 2841 tbl 03 | 
GND 1/022 
O23 Vcc . 
O25 es CAPACITANCE | 
26 | 
VOu i Symbol] Parameter” | Conditions [Max. | Uni 
O20 GND Input Capacitance (Data) 
/O32 VOs1 Cin(A) | Input Capacitance __ VIN = OV 10 | pF 
(A1-15, OE, TCS, DCS) — 
034 1/033 
035 CIN(B) Input Capacitance VIN = OV 100 | pF 
GND Ty Yiad (Ao, WE) © 7 | | 
Ao WE we) 
A2 At Output Capacitance VouT = OV 
Aa . NOTE: 2841 tbl 04 
Aéc 2 1. This parameter is guaranteed by design, but not tested. 
Vcc GND | 
OE DCS 7 | 7 
As Ar , . 
roe Ag | RECOMMENDED DC OPERATING 
GND CONDITIONS 
12 
A oe Symbol] Parameter 
A17 ail Supply Voltage 4.5 
i | | Supply Voltage 
1 GND | Supply Voltage 7 
+3 Te Input High Voltage | 2.2 
Ts = Input Low Voltage | -0.5!7) 
” - NOTE ; 
V : 2841 tbl 05 
GND 1. Vit =-—1.5V for pulse width less than 10ns. 
SIMM 2841 drw 03 
TOP VIEW 


ABSOLUTE MAXIMUM RATINGS 









NOTE: 
1. For proper operation of the module, please refer to the Jumper Connec- 
tions for proper connections of the module pins. 


Symbol | Rating” | Value | Unit 


VTERM Terminal Voltage with Respect |-—0.5 to +7.0| V 
to GND | 













OQ 


TA Operating Temperature 0 to +70 
TBIAS Temperature Under Bias —10 to +85 
TsTG Storage Temperature —55 to +125 | og | 


| lout = | DC Output Current | 50 | mA | 


RECOMMENDED OPERATING NOTE: aren 
TEMPERATURE AND SUPPLY VOLTAGE | 1. Stresses greater than those listed under ABSOLUTE MAXIMUMRATINGS 


Ambient may cause permanent damage to the device. This Is a stress rating only 
| Grade Temperature GND _. Vcc 
| 


| and functional operation of the device at these or any other conditions 
| Commercial | O°Cto+70°C | BV + 10% 


above those indicated in the operational sections of this specification is 
2841 th! 02 












not implied. Exposure to absolute maximum rating conditions for ex- 
tended periods may affect reliability. 
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DC ELECTRICAL CHARACTERISTICS 
(Vcc = 5V + 10%, TA = 0°C to +70°C) 


Symbol Test Conditions | Min. | Max. | Unit_ 
Input Leakage (except Ao, WE) | Vcc = Max., VIN = GND to Vcc ae re 
[IL12| Input Leakage (Ao, WE) Vcc = Max., VIN = GND to Vcc es 110 | WA 
















21 fe 
>i> [> 


| 10 | yA | 
| 2200 | mA | 
[VoH__| Output High Voltage | Voc = Min, lon=tmA ek | 
Voc = Min,, jo. = 8mA Paes 


2841 tbl 07 


3 
> 





AC TEST CONDITIONS 


Input Pulse Levels GND to 3.0V 
input Rise/Fall Times 







Input Timing Reference Levels _ 
Output Reference Levels 
Output Load See Figures 1—4 
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+5V +5V 


4800 4800 


DATA out DATA out 


2841 drw 04 2841 drw 05 


Figure 1. Output Load Figure 2. Output Load 
(for toLz and touz) 


* Including scope and jig. 





ATAA 
(Typical, ns) 
DATAOUT 
50Q 
1.5V 
2841 drw 06 
20 40 60 80 100 120 146 160 180 200 
CAPACITANCE (pF} 
2841 drw 07 
Figure 3. Alternate Output Load Figure 4. Alternate Lumped Capacitive Load, 


Typical Derating 
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AC ELECTRICAL CHARACTERISTICS 
Vec = 5V + 10%, TA = 0°C to +70°C 





Parameter 
| READ CYCLE 


Address Access Time 


tAA. | 
tA0A 

toe 
ton” 


to.z) | Output Enable to Output in Low-Z | 2 


WRITE CYCLE 
AW 
taoW 
WP Write Pulse Width 
| tow Data Valid to End of Write 
tDH Data Hold Time 


NOTE: 2833 tbl 08 
1. This parameter is guaranteed by design but not tested. 


TIMING WAVEFORM OF READ CYCLE“) 


ADDR _ ADDR VALID 


TAA 


_ 


DATAOUT =. DATA VALID 


2841 drw 08 


NOTE: 
1. This parameter is guaranteed by design, but not tested. 


TIMING WAVEFORM OF WRITE CYCLE 


ADDR og ADDR VALID 


> 
= 


twp 


tDW tDH 


DATAIN | DATA VALID | 


i 


2841 drw 09 





IDT7MP6048, 7MP6068 (DEVELOPMENT KIT) 
IDT79R4000 SECONDARY CACHE MODULES COMMERCIAL TEMPERATURE RANGE 





PACKAGE DIMENSIONS 





EXPLODED VIEW OF 
JUMPER AREA 


P352° pgeeec pgs 
















Pe. eee P5 o2° 








8g | 0.045 
0.055 











0.240 PIN1 0.050 SIeWwiEW 
0.260 FRONT VIEW TYP. , 
ea feng eu | aah 
onoono00on 
1°) ce) °o 
ono00n0 OO 
°o ] ° 
BACK VIEW 
PIN 14 2841 drw 10 
ORDERING INFORMATION 
IDT XXXX X XXX X X 
Device Power Speed Package Process/ 
Type Temperature 
Range 
en Blank Commercial (0°C to +70°C) 
M FR-4 SIMM (Single In-line Memory 
Module) 
12 
15 
A Speed in Nanoseconds 
25 7 
30 
S Standard Power 


7MP6048 1MByte IDT79R4000 Secondary CacheModules 
7MP6068 4MByte IDT79R4000 Secondary CacheModules 





2841 drw 11 
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R3051™ FAMILY #8 ©” ~~ a IDT79S385A. 


EVALUATION KIT 





Integrated Device Technology, Inc. 





FEATURES: | ¢ IDT's System Integration Manager included in 
- Complete 25MHz RISC System Board EPROM 
— Requires only 5V supply and terminal to — High capability debug monitor 
operate | — Simplifies software development 
— Supports R3041, R3051, R3052, or R3081 | 
highly integrated RISC CPUs ¢« Complete set of documentation included 
— Board contains anIDT 79R3052E . === =  — Supplied with complete set tof board 
— 1MB of non-interleaved DRAM, expandable tS schematics : 
to 4MB mem PAL equations supplied on IBM PC 3.5" 
— 128KB of EPROM, expandable to 2MB disks 
— Serial and Parallel Ports es “5 User's manuals for R3051 family, IDT/sim, 
— Connectors provided foreasy connectionto = = and IDT/c 
HP Logic Analyzer | 
— Wire-wrap area on the board ¢ Utility programs also included 
— Program utility disk 
- IDT/c for IBM PC compatibles included in kit — HP16500A Logic Analyzer disassembly 
— Hardware or software floating point _. . Software 


— Remote symbolic debug | | 
- R3081 sample also included for board upgrade 








'385 RISC System Board. Actuai Size 8.5" x 11" 


The IDT logo ts a registered trademark and R3051, R3041, and R3081 are trademarks of integrated Device Technology, Inc. 
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IDT79S385A 
RISC EVALUATION KIT FOR R3051 FAMILY 


DESCRIPTION: 

The IDT79S385A Evaluation Kit is a complete kit for 
evaluating the R3051 hardware and software environment. 
The kit contains a working system, including all schematics 
and theory of operation, an R3081 sample to allow the user 
to upgrade the system capabilities, and a complete software 
development environment, including debug monitor and "C" 
compiler/assembler toolchain. Finally, the kit is comple- 
mented by documentation, logic analyzer software, and 
utility programs. 


COMPLETE SINGLE BOARD COMPUTER 

The '385 board is a complete working RISC system 
intended as a complete design example using the R3051 
family of highly integrated RISC CPUs. The board requires 
only a simple CRT terminal and a 5V power supply for 
operation. Figure 2 shows a block diagram of the '385 board. 

The board is designed around IDT’s R3051 family of 
highly integrated RISC CPUs. An R3052E CPU chip (8KB 
l-cache and 2KB D-cache, with on-chip TLB) is included in 
a socket, but any member of the family can be substituted. 
The 79S385A kit includes a sample of a 25MHz R3081 ina 
PGA pinout, to allow the user to upgrade the system. Alarge 
wire-wrap area is available on the board for adding addi- 
tional hardware. All the schematics and details of the 
designs are supplied with the board, including all PAL 
equations on an IBM format 3.5" disk. 

The '385 board is supplied with 1MB of DRAM in socketed 
256Kx4 ZIPs; the ZIPs can be replaced with 4 megabit 











Signal Name 
EA00-EA31 

| EDOO-ED31 
SYSOUT 
MRES# 
MREQ 
EXACK# 
IP4-IP5 
WEA-WED 
CS 
R# 
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INTO:INTS 


| BREQ# 
| BUSGNT# 
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fe 
[v0 | sebibutecdaiesns OOSC“‘“SSSS 
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devices to obtain 4MB of DRAM on the board (an applica- 
tions brief on upgrading memory is included in the kit). Other 
hardware on board includes a 2681 DUART and an 8254 
counter/timer; both these devices are supported with drivers 
in IDT/sim. A parallel Centronics port is available for higher 
speed download of code into the board. 

The board contains 128KB of EPROM expandable to 
2MB by replacing the EPROMs with higher density devices. 
The EPROMs contain IDT’s powerful System Integration 
Manager (IDT/sim), a debugging monitor that supports 
download of code from host systems, execution control 
commands, memory probing, and I/O. 

There are two serial ports, a free-running programmable 
timer, and a parallel Centronics port for high-speed down- 
load of software. A set of expansion connectors permits 
external hardware to be connected to the board, and a wire- 
wrap area on the board can be used to build additional 
hardware without using a second board. 

The board is designed to be placed on a flat table-top 
surface. Standoffs are provided for physical support. 

The 3051 Bus, along with other control signals, is con- 
nected to a set of pins in the center of the board next to the 
wire wrap area. These signals can be used to connect 
additional hardware on either the wire-wrap area or on 
another board via a ribbon cable. DMA control is 
provided. Table 1 shows the signal description for the expan- 
sion connector. 
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Table 1. Signals Supplied on Expansion Connector 
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** These control signals include R3051 and the on-board control logic signals as well. 


Figure 2. '385 Board Block Diagram 


IDT/SIM DEBUG MONITOR SOFTWARE 

IDT’s System Integration Manager (IDT/sim) is included 
in EPROMs on the board. This software permits download- 
ing of code from a host system, execution control with 
breakpoints, in-line assembly and disassembly, and a vari- 
ety of commands to control main memory, cache memory, 
and the internal TLB. It provides all the resources needed 
to bring up new hardware and software. 

The evaluation kit also includes a complete set of user 
documentation for the IDT/sim software tool. The capabili- 
ties of IDT/sim are described in a separate data sheet. 
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IDT/C "C" COMPILER FOR IBM PC COM- 
PATIBLES 


In addition, the evaluation kit contains a complete copy of 
the IDT/c software development toolchain, hosted on IBM 
PC compatible computers. IDT/c, described in a Separate 
data sheet, includes: 

The ability to generate big- or little-endian code’ 
Hardware or sofiware floating point support 

"C" library support, including source libraries 
Remote symbolic debug 


e 


IDT79S385A 
RISC EVALUATION KIT FOR R3051 FAMILY 


LOGIC ANALYZER INTERFACE 

The 79S385A evaluation kit also includes the ability to 
simply use an HP16500A logic analyzer for execution trace 
and software debug. The board includes a set of connectors 
to easily allow connection of the logic analyzer to the board. 
Also included is a disk for the HP16500 containing disas- 
sembly software, allowing the analyzer to display a disas- 
sembled listing of the software executing on the system. 


KIT SUMMARY 


The IDT 79S385A evaluation kit is a complete low-cost 
package for evaluation of the R3051 family, especially its 
software environment. The kit allows the user to develop 
and execute high-level language programs, to look at a 
software development toolchain for the IDT R3051 family, 
and to evaluate a hardware design around the R3051 family. 


KIT CONTENTS 
'385 RISC Evaluation Board 
IDT/c Multi-Host compiler toolchain for IBM PC com- 
patibles 
IDT/sim debug monitor included in board EPROMs. 
User's Manuals for: 
'385 board 
R3051 
R3081 
IDT/sim 
IDT/c 
Applications Guide for R3051 family 
Program Utility Disk 
Disassembler for HP16500 Logic Analyzer 
PAL Equations on IBM PC compatible 3.5" disk format 
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BOARD SPECIFICATIONS 


CPU 
25MHz R3052E on board 
25MHz R3081 sample included 


Cache Ram 
8KB |-cache, 2KB D-cache (in 3052 chip) 
16KB |-Cache, 4KB D-Cache configurable to 
8KB |-Cache, 8KB-DCache (in R3081 chip) 


Cacheable Address Space 
4GB 


DMA Support 
Bi-directional tri-stateable buffers can be used to 
write to DRAM from external logic 


Block Refill 
4 word instruction block size 
1 or 4 word data block size programmable via jumper 


Endianness (Byte Ordering) 
User programmable via jumper 


Read/Write Buffers | 
Both are 4 words deep (inside 3052 chip) 


Interrupts 
6 User Interrupts, three synchronized with SYSCLK 


1/0 characteristics 
TTL levels from FCT logic devices, PALs and R3052 


Power Supply 
2 amps (typical) at 5V, 25°C, at rated speed 


Environmental Conditions 
Ambient temperature 0°C to +50°C 
Relative Humidity 5% to 95% 





Clock Frequency 
25MHz 


Interconnection 
Five 50-pin connectors, containing Address, Data, and 
Control signals and R3052 signals 
Five 20-pin plugs for use with HP logic analyzer 
Two RS-232 serial ports on DB-25 connectors 
One parallel Centronics port for input 


User Selectable Options 
Endianness, data block refill size 
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ORDERING INFORMATION | 2% 7 eras : 

Each unit is shipped with complete schematics and PAL equations. A -user's manual includes. instructions on 
downloading code, operating the Software Integration Manager, and providing the correct timing interfaces to additional 
hardware. Boards are shipped with the 3052E CPU plugged in, but any member of the 3051 family can be used. An 
additional sample of the R3081 is included to allow user upgrades. 


Evaluation Boards | | : 
RS0S1-Familly Evaluation Kit 5 5.csiatcseh Becieicateceecctenteetterl teats cts spests lead acacia tn aaa ata veeseee T9S3B5A 


EPROM Upgrades 
The following part numbers update the evaluation board hardware to the latest version of the IDT/sim 
monitor. 
EV eI ARLOIA: CANS oes de caeace tc eds ics cies Peladald ones ceceticcmen bd sductedeScuncsenaatcasswaticaudaccisahusesmateted aa emaaties 7RS901BGP 


Use with 79S385 only 


Auxillary Download Programs 
For downloading code from a MIPS machine into an evaluation board. This software includes pro- 
grams to convert MIPS object code into S-records and to download either ASCII or binary S-records to 
a remote target. This software is only needed if you are running the MIPS C-compiler and do not have 
SPP. If you are using IDT/c-or you have IDT/sim or MIPS SPP you already have these utilities. 


MIPS download utility........ atau vetilade aes cba se suabcadeeup eldacwad dacs cost cueeunpadvaeusbiamudiede tn aceavducxcrerexaceenubtgotabeseutiess 7RS950BUU 
Supplied on QIC-24 TAR tape 
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IDT/sim™ 


NEW! Version 4.0 IDT7RS901 


SYSTEM INTEGRATION MANAGER 
ROMable DEBUGGING KERNEL 





Integrated Device Technology, Inc. 


FEATURES: 
¢- Complete Source Code Provided 


¢ Robust Debug Monitor 


¢- Supports Source Level Debug 
DBX - MIPS Tool Chain 
IDB - IDT/c™ Tool Chain 


- Remote File Access - Connects Target & Remote 
Host 


¢ Diagnostic Tests for Memory, Cache, MMU, FPU, 
and System 


¢ Adaptable to Systems With or Without Hardware 
Floating Point Accelerator 


- Includes Variety of Device Drivers 
« Easy to Add New Commands and 1/O drivers 


FOR R3000 ISA CPUs 


POWERFUL TOOL FOR INTEGRATION OF 
SYSTEMS BASED ON R3000 ISA CPUS: 


The IDT7RS901 System Integration Manager (IDT/sim™) 
is a ROMable software product that permits convenient 
control and debug of RISC systems built around R3000 ISA 
CPUs (R3000, R3001, R3500, 3051 Family, 3081 Family). 
Facilities are included to operate the CPU under controlled 
conditions, examining and altering the contents of memory, 
manipulating and controlling R38000 resources (such as 
cache, TLB and coprocessors), loading programs from host 
machines, and controlling the path of execution of loaded 
programs. 

IDT/sim source code includes IDT’s MicroMonitor, a very 
simple monitor for performing the initial debugging of new 
hardware. 

IDT/sim requires 115KB of EPROM space for code and 
data, and 71KB of RAM space for uninitialized variable data 
and stack. 










| HOST 


SYSTEM 
( MacStation™, MIPS, Sun, or PC ) 


& REMOTE DEBUG) 


RS232C CABLE 
(DOWNLOAD 






oe 
ieee 


SYSTEM UNDER 


| DEVELOPMENT | | 


TARGET WITH IDT/sim in EPROMs 
System Integration Manager 


The IDT logo is a registered trademark and IDT/sim and IDT/c are trademarks of Integrated Device Technology, Inc. 
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IDT/SIM FEATURES 


IDT/sim is a software tool to help system designers debug 
hardware designs and port software to systems based on 
one of the R3000 ISA CPUs (R3000, R3001, R3500, 3051 


Family, 3081 Family). The software is supplied in EPROMs | 


on most IDT RISC SubSystem Development products, and 
may be purchased in source code form so it canbe compiled 
and installed on your system. 

_ IDT/sim provides all the basic functions needed to get a 
new hardware design debugged and to port and debug 
software onit. Typically, the monitor is compiled and burned 
into EPROMs that are plugged into the target system. 
Approximately 115KB of EPROM are needed for the binary 
code, and 71KB of RAM are needed for storing variables. 
Once installed; the designer communicates with the monitor 
via a simple terminal connected to an RS-232 port on the 
target system. Source code is included to support a variety 
of UARTs for this port. On start-up, the monitor will deter- 
mine the cache and main memory sizes automatically. 


Diagnostics 
The monitor includes a set of diagnostic routines for 
testing the integrity of the hardware. 


Main Memory Test: Finds opens, shorts, and stuck-at 
faults on data and address lines. A cache memory test 
runs memory tests on both caches, checks tag memory, 
and verifies that instructions can be executed from 
cache. 


System Test: Checks the ability to read and store full 
words, half words, and bytes. Checks cache operation 
for valid, hit/miss, and invalidation. 

MMU Test: Checks operation of TLB inside the R3000. 


Floating Point Test: Tests the functionality of the R3010 
FPU, including exception interrupts. 


Download Support 

Object code created on a software development system 
can be downloaded in either ASCII S-records or binary 
formats to the target system’s memory. The code can be 
produced with the MIPS development tools or with IDT/c on 
any of anumber of development platforms: MIPs, Sun, IDT's 
MacStation, and 386/486 PCs under Xenix or DOS. 

IDT/sim source code includes utilities to convert object 
code from the MIPS compiler to S-records, to convert the S- 
records to a binary format (which is more compressed and 
downloads faster), and to download the binary records to 
the target. Similar utilities for use with the IDT/c multi-host C 
compiler are supplied with IDT/c. » 

A terminal emulation feature allows the terminal, used as 
the IDT/sim console, to also be used as a terminal to a 
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software seveloonied system accessed hemigh a second 
serial port. This mode supports remote file download. 


‘Debug Commands 


There are a variety of commands included in IDT/sim to 
support software/hardware debug. 


Execution Control: Breakpoint, call, continue, go, gotill, 
next, step, unbreak. 


Memory Commands: Assemble, cache flush, 
compare,disassemble, dump, dump cache, dump 
registers, fill, fill registers, move, read/write cache, search 
and substitute. 


TLB Commands: Dump, flush, map, pid and probe. 


Remote Debug: Source level debug with DBX on a MIPS 
RISC/os system and with IDT Cross development ‘c’ 
compiler tools. 


Communications: Remote file access, terminal emulator 
and set baud rate. 


Run-Time Support 

IDT/sim includes over 40 functions that can be called by 
user's programs to perform common I/O and R3000 control 
operations. A complete list of the commands is listed later 
in this document. 


NEW FEATURES IN VERSION 4.0 


IDT MicroMonitor: IDT/sim includes IDT’s MicroMonitor, a 
very simple monitor for performing the initial debugging of 
new hardware. The only hardware which must be function- 
ing to run the MicroMonitor is the CPU, EPROM and a serial 
port function. This allows for immediate debugging of 
hardware even when the DRAM memory is not functioning. 


Source Level Debug: Supports source level debug using 
either IDT/c or the MIPS Tool Chain. 


Remote File Access: Connects target with remote host file 
system allowing file transfer between target and host. 


Trace Facility: Traces the memory accesses of a user 
program. Provides for tracing the path of execution reads 
from and writes to memory. Trace qualifiers allow the 
tracing of a specific instruction or class of instructions. Also 
specific memory ranges may be specified. The user may 
stop tracing on the following conditions: Trace buffer full, 
hitting a breakpoint, executing a specific instruction or 
accessing a specific memory range. The trace buffer 
contents may be displayed using standard R3000 family 
mnemonics. _ 
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Virtual Address 
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The figure above shows the memory utilized by IDT/sim. EPROMspace, andis normally placed in 128KB of EPROM. 
The EPROM space starts at virtual address bfc00000, which IDT/sim uses main memory to store interrupt vectors, vari- 
is the R38000’s start-up address. The compiled version of ables, and astack. 71KB of RAM space is reserved for this 
IDT/sim with all features included occupies about 115KB of data. | 
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IDT/SIM COMMANDS 
asm <addr> Examine and change memory inter- 


actively using standard assembler mnemonics. 
brk/b [addresslist] Set/display Breakpoints. 
cacheflush/cf [-i/-d] 


callica <address> [arg1 arg2... arg8] Call Subroutine with 
up to 8 arguments. 


Flush |-cache and /or D-cache. 


checksum|/cs [start_addr num_bytes] Display the 


checksums for an address range. 


compare|/cp [-w/-b/-h] <RANGE> <destination> Compare 
the block of memory specified by RANGE to the block of 
memory that starts at destination. 


cont/c Continues execution of the client process from where 
itlast halted execution. 


dbgint/di [<-e/-d DEV>] Debug interrupt enable/disable - 
allows ‘break key’ to generate external interrupt. 


debug/db [DEV] Enter remote debug mode. 


dis <RANGE> 
RANGE. 


disptag/dt [-i] RANGE Displays the instruction or data 
cache tag values and data contents. 


Disassemble target memory specified by 


dr [reg#/name\reg ia PE the current contents of 
register(s). 


dt Dump the trace buffer. 


dump/d [-w/-h] <RANGE> Dump the memory peeceiee by 
RANGE to the display . 


enable DEVICE Connect to remote host for file access 


filllf [-w]-h]-b]-I]-1] <RANGE> [value_list] Fills memory 
specified by range with value_list. 


. ff [-s|-d] <reg#/name> <value> Fill <reg#name>with<value>. 
| go/g [-n] <address> Start execution at address <address>. 


gotill|gt <address> Continue execution until address <ad- 
dress>. 


help/? [commandlist] This command will print out a list of 
the commands available in the monitor. If a command list 
is supplied, only the syntax for the commands in the list is 
displayed. . 


history/h Display last 16 commands entered. 


- idb [DEVICE] Connect to remote host source level 
debugger.init/i_ \nitialize prom monitor (warm reset). 
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load/I [options] DEV Download code to target. 


move/m [-w]-b/-h] <RANGE> <destination> Move the block 
of memory specified by RANGE to the address specified by 
destination. 


next/n feount} Steo over subroutine calls. 
rad [-o/-d/-h] Set the default radix to the requested base. 
re [-i] [-w|-b/-h] <RANGE> |solate and read from cache. 


rdfile <filename> <RANGE> Read file from remote host file 
system. 


regsel/rs [-c/-h] Select either the compiler names or the 
hardware names for registers. | 


search/|sr [-w]-b|-h] <RANGE> <value> [mask] Search area 
of memory for value. — 


seg [-0/-1/-2/-u] Set the default segment to the requested k- 
segment. | 


setbaud/sb DEV Set the baud rate on a serial channel. 
step/s [count] Single step count times. 


sub [-w/-h/-b/-l/-r] <address> 
memory interactively. 


t {-a/-0/-e/-d/-r RANGE/-w RANGE/-c RANGE/-i INS/-m 
MSK} Trace command. 


tc [-e BPNUM] [-d BPNUM] Trace conditionally command. 


te [DEV] Connects the console port straight through to a 
second serial port. 


tex [RANGE] Exclude tracing calls to RANGE. 
tlbdump|td [RANGE] Dumps the contents of the TLB. 
tibflush|tf [RANGE] 


tlbmap/tm [-i index] [-ndgv] <vaddress> <paddress> 
Virtual to physical mapping of the TLB. 


tlbpid/ti [ pid] Set/display TLB PID. 


Examine and change 


Invalidates the contents of the TLB. 


tlbptov/tp <physaddr> Probe the TLB. 


ts [-b/-t/-0/-r RANGE/-w RANGE/-i INS/-m MSK] Stop trace 
comm and. 


unbrkjub | <bpnumlist> Clear breakpoints. 


we [-i] [-w/-b/-h] <RANGE> [value_ uae Isolate and write 
to | or D cache. 


wifile <filename> [value_list] Write file to remote host file 
system. 
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LIST OF RUN TIME SUPPORT ENTRY POINTS 


_exit install_normal_int ropen* 
atob ioctl rprintf* 
clear_cache longjmp rread* 
cli open rwrite* 
close printf set_mem-conf 
exc_utib_code putchar setimp 
flush_cache puts showcar 
get_mem_conf rclose* sprintf* 
get_range read* strcat 
getchar reinit strcmp 
gets reset strcpy 
install_command restart strlen 
install_ immediate_int rfileinit® tokenize 
install_ new_dev rgets* write 
rlseek* 


* New in Version 4.0 
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ORDERING INFORMATION 


To upgrade an IDT board level product, see the EPROM order codes below. To order IDT/sim in source code, order 
the Internal Use License AND order the software on the appropriate source media. You may also order binary distribution 
rights for the run-time version of the monitor. Ask your IDT sales office for information. 


Licenses 

Internal SOUPCO: WICC SO is siiccccescicsedecececevaesvavenscevasensustateusacssavaiaedivadseassiguenceusuraatadencacsssvisnsnccovsoeite 7RS901SLV 
Permits purchase of up to six copies of source code (any media combination) and use of source code to 
develop run-time binaries on up to six machines at a time, but does not permit inclusion of the run time code 
in an end product. Also purchase one or more of the Source Media listed below. 

Limited Binary Distribution Right. .............ccssssssssssssnssssesescnesessensecsessensnenneceseaesceeeeeeesneesseeseneeens 7RS901BLP-L 
Extension to Internal Source License to permit inclusion of bing code into end product. Internal Source — 
License must be referenced on order or ordered simultaneously. This license permits up to 100 copies tobe 
distributed royalty-free. For additional copies purchase the Unlimited Binary Distribution Rights . | 

UnLimited Binary Distribution Rights ..............:scssscressssssssssceceeseessnenensorssenseueneeesesesensenenansonsnees 7RS901BLU-L 
Extension to Limited Binary Distribution Rights to allow unlimited distribution of binary code. Internal Source 
License and Limited Binary Distribution Rights must be referenced on order or ordered simultaneously. 

Main tenaniGe Agreement os sscesasscvassestescnsensacsccssaveledievcuestasvcseicssesusesdsawauecvencssineoduncccavtervedtenscncsiecevexs 7RS901SSY 
One year free updates. We supply a direct telephone contact for support. 


Source Media 

IDT/sim source code can be compiled with either the MIPS C compiler or with IDT/c version 4.1 or later. Earlier versions of 
IDT/c cannot compile this code. The products listed below are media only and must be purchased with license 7RS901SLV 
above. 


Source for 386/486 PC, MS-DOS. .........csscccsesescesseesvenscsscsssncnsessenssussecssssssccssessecsteccanseesensensunesess 7RS901SBF-L 
Compile with IDT/c C-Compiler. Shipped with both 1.2 MB 5.25" and 1.44 MB 3.5" diskettes. 

Source for 386/486 PC, SCO Xemix .ss..ccssacerssrssscsesscnnscesccssssscncesenssesesnsnenssesevessccscccnsasesssees ...7RS901SXX-L 
Use with IDT/c C-Compiler. Shipped with both 1.2 MB 5.25" and 1.44 MB 3.5" diskettes. 

Source for IDT MacStation, ON Mac DiSC.............scsssssesesssssseaesesesesenesesssseesaeseeessennseesensaesenes 7RS901SMD-L 
Use with MIPS C Compiler supplied with MacStation or with IDT/c. | 

Source for MIPS / SUN machines, on DC6150 QIC TAR Tape ..........ccccssssccessseceeenseeseesenensnseees 7RS901SUU-L 


Use with MIPS C Compiler or with IDT/c. 


EPROM Versions 

The following versions of IDT/sim are supplied in EPROMs for the indicated hardware. These versions are for updating the 
hardware to the latest version of the monitor. No license feguiteds An upgrade is NOT available for the 7RS382 and 7RS383 
Evaluation Boards. 


For Any 7RS30x, 7RS40x Prototyping System ...........::ccccsseccseceeseseeersssseeseeeseneesesssesneceensesssenaeess 7RS901BAP 
For 7RS388 Real8™ Laser Printer Controller ...........ccccsscccccssesseccececcccnnecsccceecescccsceccsessccscensneereeess 7RS901BFP 
For 7RS385 Evaluation Board ..........ccscccsssccsssscccssscnssscccssscnsccccssscecssecesscceceracssesnncesanececeasenssscnessoess 7RS901BGP 
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Integrated Device Technology, Inc. 


IDT7RS903 


Multi-Host C-Compiler System 








FEATURES: 
¢ ANSI C-compiler, Optimizing Scheduler, Assem- 
bler, Linker, Librarian, and ANSI Libraries 


¢ Efficient Floating Point Emulation Mode for 
systems without hardware FPU. Includes 
Transcendentals 


¢ Symbolic and assembly level debug support. 


¢ Versions available for 80386 machines under 
MS-DOS™, MIPS machines and MacStation 
under RISC/os, and Sun SparcStation 


¢ Provides control over multiple memory seg- 
ments 


¢ Supports entire IDT family of MIPS ISA Proces- 
sors (R3000, R3001, R3051/2, R3081, and R3500) 


Host Development 
Systems 
MIPS boxes 
SUN SPARC , 386 PC 
and MacStation 


ea Gib 


Assembly Code. 


"” Exception 
‘Handling |.” 
|” FP Exception” 
* Handling ae 


adl / bdl 


704-00903-002/B 





OPTIMIZING C-COMPILER SYSTEM: 

IDT/c is C compiler system for the MIPS R3000 and 
derivatives, specifically designed for developing and debug- 
ging code that runs on aremote target. The compiler system 
includes the GNU C compiler, an assembler with instruction 
optimization, a linker, a librarian, and a symbolic debugger. 
A complete floating point emulation library is included also. 

The IDT/c package is available for execution on 386 
machines under MS-DOS, on the MIPS and SUN worksta- 
tions, and on IDT's MacStation. 

Unique features of IDT/c include the ability to divide code 
into segments for programming ROMs, the ability to relocate 
initialized variables into ROM space, extremely efficient 
floating point emulation (up to ten times faster than other 
emulation methods), and a programming language in the 
symbolic debugger to control execution by testing data and 
addresses in the program. 

New features in release 4.1 of IDT/c include the symbolic 
debug facility, a switch to produce assembly language 
output for the MIPS tool chain, and a full set of transcenden- 
tal arithmetic functions. 


The IDT Tool Chain 


Executable 
Code 
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IDT7RS903 is 
IDT/c R3000 C-COMPILER 


OVERVIEW : 

The IDT/c C-Compiler System is a 
complete development package for 
CPUs based.on the R3000 architec- 
ture. It contains an optimizing cross 
compiler, optimizing scheduler, as- 
sembler, linker, anda downloader. The 
‘C’ compiler is compliant with ANSI ‘C’ 
standard and performs the optimiza- 


tions available in state-o-the-art ‘C’ - 


compilers. The assembler supports 
the R3000 machine instructions and 
architecture described in the book by 


Gerry Kane, “MIPS RISC Architecture”, 


including both native and synthetic in- 
structions. The complete IDT/c pack- 
age runs on a variety of host machines 
and operating systems and is compat- 
ible with other IDT development soft- 
ware, such as IDT/sim and IDT/kit. 


Compiler 

The C pre-processor is GNU cpp 
and the compiler itself is based on GNU 
C. All C-preprocessing features are 
supported. The combination of the 
compiler and assembler included in 
IDT/c has been tested for compliance 
to the ANSI C standard using the Plum 
Hall test suite. The C compiler per- 
forms extensive optimization in mul- 
tiple passes through the code. Switches 
can be used to fine tune the optimiza- 
tions. 

The output of the compiler is an 
assembly language file. For the ver- 
sion of IDT/c that runs on the MIPS 
_ workstation, a switch selects whether 
output is for the IDT assembler or for 
_ the MIPS assembler. If the MIPS as- 
- sembler output is chosen, then mod- 
ules compiled by IDT/c can be as- 
sembled and linked in the MIPS envi- 
ronment with modules compiled by the 
MIPS compiler. IDT/c is far more effi- 
cient than MIPS c for floating point 
emulation. 
switch makes it possible to use IDT/c 
only for modules with. floating point 
code, and MIPS c for everything else. 


Optimizing Scheduler and 
Assembler | 

The IDT assembler implements the 
R3000 native instruction set as well as 
the augmented synthetic instructions 


The MIPS compatibility: 


GNU Compiler 


instruction 
Scheduler / Optimizer 


fromother | 
assemblile 


Object Code 
.0 





IDB data Absolute Sr 
ymb. debug Listing 


PreProcessor] | 
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_ Librarian 


a) 
<= 


emory Map 


PROMC 
EPROM ~ 
formatter 


modified 
S-record 7 | 
file 


to Download 4d. 


IDT/c Flow 


defined in the “MIPS RISC ARCHI- 
TECTURE” book by Gerry Kane. An 
optimizing scheduler first expands the 
synthetic instructions into the native 
instruction set. It then rearranges code 
to take advantage of R3000 pipeline 
architecture. The scheduler also ana- 
lyzes loads of static constants and tries 
to make use of previously loaded con- 
stants. The assembler produces. .o files 
which can be linked together with other 


files to produce an executable file. 





Memory Description File | 

The memory description file is used 
to instruct the linker where to place 
‘object modules in the R3000 memory 
map. It tells the linker what address 
classes are legal, what addresses exist 
within those classes, and what ad- 
dresses should be written to output 
files. The file consists of a sequence of 
class specifications (CODE, DATA, 
etc.) and associated address ranges. 
It is possible to control placement of 
individual modules. 
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Linker 

The linker combines separately assembled program files 
into one object module. Command line switches may be 
used to override the memory description file. 

There are three types of output file formats supported: S- 
Records, Intel hex, and binary image. The S-Record files 
are useful in down-loading to target boards. The hex format 
file is useful for EPROM programming because the code can 
be divided into multiple files under this format. S-Records 
can be downloaded to a target containing the IDT/sim 
monitor using a supplied download utility (DOS) or uucp 
(UNIX). Additionally, the IDB facilities in IDT/sim v 4.0 and 
IDT/c v 4.1 provide fast, reliable download of S-Records. 


Endianess 
IDT/c includes a switch so that code may be compiled in 
either Big-endian or Little-endian format. 


Floating Point Library 

IDT/c includes a floating point emulation library. A switch 
in the compiler is set at compile time to determine how the 
compiler should handle floating point instructions. In the 
normal mode, it will produce R3010 Floating Point Accelera- 
tor instructions in the object code. If the switch is set the 


TYPICAL COMPILATION SEQUENCE 


other way, the compiler will insert calls to the floating point 
library instead, and the floating point library must be avail- 
able at link time. Because the compiler knows about the 
library during compile time, it can perform optimizations not 
otherwise possible and minimize the execution penalty for 
using software instead of hardware. 


Librarian 

IDT/c supports object code library files. Many compiled 
routines may stored in a single library file by using the 
Librarian utility. At link time, the linker extracts only the 
routines actually used. This technique reduces the number 
of files that must be dealt with explicitly during program 
development. 


PROM-C 

PROM-C is a utility included with IDT/c that permits 
variables initialized by the program to be moved from their 
normal locations in the object code into designated memory 
space destined for EPROM. The user program can execute 
a simple routine at start-up to move the variables from 
EPROM back into RAM space at the appropriate locations. 


Assume 4 C modules, m0.c m1.c m2.c m3.c . m0.c contains ‘main’ function. 


Compile the programs: 
idtcl -oO 


=G =ZA Mm0.c mice m2.c¢- m3 .c 


(the default compiler mode determined at installation time is used) 


Make a library: 


1lib -c Libya Mm1l.6o Mm2<O M325 


Link it all : 


idtcl -o prog -ZT80020000 -ZD80030000 m0.o lib.a 


IDB - SYMBOLIC REMOTE DEBUGGER 


Remote debugging differs from the ‘conventional’ in sev- 
eral ways. The control of target program relies on commu- 
nication line and debugging agent on the target instead of 
using 0/s signals and related services. 

Idb was specifically designed to work with IDT/sim to 
provide full control of the target program. The distinguishing 
features are: a program mode in which idb executes scripts 
that contain debugging and flow control commands; and 
host file services which provide target program with full 
access to host files. The required physical link between host 
and target is a single RS232 line capable of 19200 bps. 





It is also possible to use direct low-level IDT/sim debug- 
ging from an idb session. is 


Program mode 
The script sample on page 6 (last page) illustrates some 
of capabilities of the program mode. 


Remote file services 
IDB supports file open, close, read, write, seek; printf; and 
gets commands in the standard C library format. 
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FLOATING POINT EMULATION MODE 


When floating point and double length variables are used 
inC programs, compilers usually produce assembly instruc- 
tions that directly operate on floating point arguments. Most 
of the time this also requires use of designated register set 
to hold floating point operands. The C compiler is aware of 
underlying hardware and attempts to produce optimal code 
by using all available resources. 

The R3000 architecture has the floating point coproces- 
sor in separate chip (R3010). There are numerous floating 
point instructions that operate on the 32 floating point 
registers inside the R3010. When floating point hardware is 
not present — for example, an R3000 without the R3010, or 
an R3051 or R3052 controller chip — executing programs 
that use floating point arithmetic requires software that can 
compensate for the missing hardware. There are two basic 
solutions to this problem: trapping on the floating point 
instructions at execution time, and simulating the FPU in 
software in the trap handler; and modifying the compiler so 
that it does not produce any instructions for the FPU in its 
Output. 

Using the operating environment to trap all attempts to 
execute instructions on (non-existent) floating point hard- 
ware offers the advantage of using a single object code 
version of the application whether hardware FPU is present 
or not. The disadvantage is that the complete FPU state 
machine must be emulated to the last detail since the code 
produced by the C compiler expects the real hardware to be 
present. The code that must be executed for each FP 
instruction is substantial: the trapping overhead for each FP 
instruction, the maintenance of the FP state machine, and 
the instructions to execute the required FP operation on 
integer hardware. 

For example, the sequence below requires 3 traps, each 
of which involves saving all the registers used in the particu- 
lar trap routine, maintaining the state of the ‘virtual’ FP 
register set somewhere in memory, performing the actual 
FP arithmetic (double addition), and updating the ‘virtual’ 
FPU status register bits. 


lwel S£14, ($9) 
lwel Sf1i5., 4($9) 
add.d $f8, S$f14, Sf6 


The solution implemented in IDT/c is to switch the com- 
piler to a different mode for the two environments. In emu- 
lation mode, the compiler does not assume presence of any 
additional hardware, so only R3000 instructions are pro- 
duced, and FP operations are performed by calls to special 
routines. The calls are compiler-generated and there is 
absolutely no difference on the C source level. The same C 
program that generated the example above would generate 
the following code in IDT/c emulation mode. 


lw | Sar too) 
lw $5, 4(S9).. 
jal __adddf3 
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The only overhead is that of performing FP operations on 
R3000. On floating point intensive applications, IDT/c typi- 
cally yields execution times four to five times slower than 
R3010 execution times, but eight to twelve time faster than 
the FP hardware emulation method described above. 


IDT/C PERFORMANCE 


The performance of IDT/c has been measured against 
the MIPS C compiler, which is a well-respected tool chain 
offering the highest performance on RISC machines. 


Floating Point Emulation 

Table 1 shows the relative performance of a floating point 
intensive program under three different floating point op- 
tions. The hardware in all three cases was the the 7RS388 
Laser Printer controller board. This board uses the R3000, 
the R3010, and 16KB each of Instruction and Data cache. 
The board was operating at 25MHz. 


Seo Trap Emulation | IDT/c een 





Table 1. IDT/c FP Emulation Performance 


The column labeled “Hardware” shows the results of aMIPS 
C compilation using hardware FPA instructions. | 
The column labeled “Trap Emulation” shows the same 
binary run in the same system with no hardware FPA. The 
operating system traps on the R3010 instructions, and the 
trap handler emulates the R3010 hardware in software. 
The last column shows the results for IDT/c Emulation 
mode. In this test, the code was re-compiled with IDT/c in 
FP Emulation Mode. Whenever a floating point operation is 
required, the compiler generates a call to the appropriate 
library routine to perform the function. There is no trap 
overhead. 


e 
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Integer Comparisons 

Table 2 illustrates the results of a set of benchmarks 
compiled by both the IDT/c and MIPS “C” compilers. These 
benchmarks are commonly referred to as “The Intel Bench- 
mark Suite”, since Intel introduced them to measure the 
performance of various embedded processors began when 
they announced the i960CA. 


MIPS C 


Benchmark 
Anneal 
BubbleSort 


0 


Dhrystone 
MatMult 
P|-500 
QuickSort 


Table 2. IDT/c vs. MIPS C on Intel Benchmarks 


Table 3 may be more representative of the range of 
differences, as the Stanford Benchmark suite tends to 
exercise more of the processor. 


Benchmark MIPS C 
Perm 
Towers 


Queens 


IntMatMult 
Puzzle 

| QuickSort 
BubbleSort 


Table 3. IDT/c vs. MIPS C on Stanford Benchmarks 


The results indicate a variety of performance differences 
between IDT/c and MIPS “C” across these benchmarks. 
Note, however, that these benchmarks may not be fully 
representative of either compiler, as they are extremely 
small programs using only integer arithmetic. 

Note that the performance difference between these 
compilers is different across different hardware platforms. 
Specifically, the ability of the benchmark to remain cache 
resident will influence the performance gain of MIPS tech- 
niques such as procedure inlining and loop unrolling. Sys- 
tems with differing cache sizes and/or memory latency may 
then show different results for integer code. 


Mix and Match Strategy 

To maximize performance, a system designer could 
choose to use a “mix and match” strategy in the software 
toolchain. For example, the bulk of the application could be 
compiled using the MIPS compiler, while IDT/c is utilized in 
the floating point intensive portions of the code. 

This approach marries the best of both toolchains. The 
MIPS compiler extracts maximum performance from the 
majority of the integer only code, while IDT/c does the best 
job of performing floating point operations in software. 
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IDT/c facilitates this approach by allowing IDT/c to use 
the MIPS backend assembler, thus allowing code gener- 
ated by IDT/c to be directly linked with code generated by the 
MIPS compiler. Thus, the programmer can use IDT/c (with 
the MIPS backend assembler) on the floating point intensive 
code, and the MIPS compiler on the rest of the code. 

Table 4 illustrates the performance gain achievable when 
using such a mix and match strategy. Inthis table, two of the 
Stanford Benchmarks are shown with an IDT/c only, and 
with a mix and match strategy. 


Benchmark IDT/c Mix and Match | 





Table 4. IDT/c with IDT/c back-end vs. MIPS back-end 
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1>bp fool 
--2>bp £oo2 

10>c_ "  ~3 | continue 

-20>if $cb .eq. 1 then 50 if stopped in fool just 

_ | _  vecord the stack pointer 


| | value 
30>if $cb .eq. 2 then 60 if in foo2 goto 60 
40>stop _ | | stopped somewhere else (not 
| in fool or foo2) 

50>Sstack = $29 

55>goto 10 

60>if $29 .gt. S$stack then 10 continue 

70>ub 1 stack is too low. remove 

755ub: 2° bp 1 and bp 2. 

80>bp int_handler set breakpoint in 
int_handler 

90>c continue 

100>if S$cb .-ne. 1 then 40 not in int_handler 

110>if $29 .gt. 0xa0060000 then 90 

not dangerously low 

120>if $4 .gt. 10 then 200 int_handler should never be 
called with the first arg 
greater then 10 

130>stop 

200>ar display argument 

210>stop . 


Sample Program in IDB Control Language 
This example monitors stack depth difference in two 
routines and if positive starts to trace stack values in the 
third routine. 





ORDERING INFORMATION 


The IDT/c C-Compiler is an efficient R3000 C-compiler system based on the popular GNU C and hosted on a variety 
of computers. The IDT/c system includes the compiler, assembler, scheduler and linker. All PC versions of the software 


_. are shipped with both 1.2 MB floppy discs and 1.44MB 3.5" diskettes. A "boxtop" single user license is included with the 


product. Contact your IDT sales office for multiple user HeeNSING: 


Media, with Floating Point Library 
The software listed below includes the floating point library. 


For 386/486 machine, MS-DOG..................s:ccsssscscccssssssesscssncnssenessccecccecserenceessscsseseeseeeess 7RS903FBBF-N 
This product uses extended memory space on the 386. 4 MB recommended. 7 

For 386/486 machine, SCO Xemix ..........scsccsssseecessssscesssceesesssceessserseesaeeecerenneeessssenenesens 7RS903FBXX-N 

For MIPS machine RISC/os, on DC6150 QIC TAR Tape ..........ccssscserssesecesserssceseneees 7RS903FBUU-N 

For MacStation, on Macintosh DISC ............ccsssssssescseescsesesseesesssssosansssseeseeeeessseneeeseses 7RS903FBMD-N 
Runs on MacStation R3000 board under IDT/ux. 

For SUN Sparcstation, on DC6150 QIC TAR tape ............::csssssssecesesssssecessessssssssensnes 7RS903FBWU-N | 
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IDT/kit™ IDT7RS909 
KERNEL INTEGRATION TOOLKIT 


Integrated Device Technology, Inc. 


FEATURES: 


¢ Source code and object code versions of com- 





ESSENTIAL CODE FOR R30XX SYSTEMS 
IDT/kit (Kernal Integration Toolkit) consists of libraries 
and routines for important system software operations for 


monly used routines for an R3000 ISA CPU R3000-based CPUs. Modules are provided for initializing . 
¢ Start-Up Code to initialize CPU, MMU, and C systems, handling interrupts, servicing floating point excep- 


runtime environment 


tions, and many other other common operations. Libraries 
are included for floating point emulation, transcendental 


¢« Cache control code to size, initialize, flush, and arithmetic routines, and ANSI standard C functions. All IDT/ 


clear for DMA 


kit libraries are supplied in source code (C and assembly) 
and in object modules compiled for both little- and big- 


- Re-entrant Exception Handler endian systems and for both hardware and software emula- 
- Floating Point Emulation Library and Transcen- __ ton floating point. . 


dental Math Functions 


IDT’s MicroMonitor is also included in the IDT/kit package. 
The MicroMonitor is a very simple monitor for initial debug 


¢ ANSI Standard C Library of new hardware. It requires only that the CPU, EPROM, 
Tine Suaport Functions and a serial port be operational. The MicroMonitor can be 
PP an invaluable aid for detecting state machine problems in 


¢ MicroMonitor for initial hardware debug first article hardware. 


¢ Interface Library to IDT/sim™ monitor 





IDT/sim 
Debug 
Monitor 


IDT/kit 


Application Code 


High Level | FPU exception | Transcendental 
Exception Handler handler Math Library ee clib | Start Up 
General Exception Floating Point control | Library | Code 
Handler Emulation 
(optional) 


R3000 ISA CPU 








Schematic Representation of the modules in IDT/kit, showing how they control 
parts of the R3000 CPU and connect to IDT/sim, IDT’s debug monitor. 


The IDT logo is a registered trademark and IDT/c, IDT/kit and IDT/sim are trademarks of Integrated Device Technology, Inc. 
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IDT/KIT“ FEATURES 

The IDT Kernel Integration Toolkit (IDT/kit) consists of a 
set of libraries ready to be linked with user developed code. 
IDT/kit contains functions that would normally be furnished 


by an operating system like UNIX but without the overhead. 


Functions are provided for initializing the system, memory 
management, exception handling and time support; an 
ANSI standard ‘C’ library and a math library with transcen- 
dental functions are supplied. 

This environment can be compiled with IDT/c or MIPS 
compilers, Big or Little Endian, Cached or Uncached and 
with an optional Emulation Mode if no Floating Point Accel- 
erator is installed. With IDT/kit, floating point emulation 
support is transparent to the user application. 

IDT/kit typically would co-exist with IDT/sim and become 
part of a total development and debug environment. On a 
system where IDT/sim is installed, all the commands, entry 
points and debugging facilities of IDT/sim are available to 
the Kernel Integration Toolkit. When using IDT/sim, IDT/kit 
filters exceptions first. If IDT/kit does not handle an excep- 
tion, then it is passed to IDT/sim. 

_ Default exception handlers intercept exceptions, save 
the environment, preserve the Exception Registers for later 
analysis, restore the environment and return to continue 
program execution. The default handlers can easily be 
replaced or extended with more robust handlers written by 
the user. 

- -IDT/kit relieves the application from the low level tweak- 
ing necessary to get started but leaves easy hooks into the 
system for expansion and polishing as the development 
progresses. All code is supplied in source code (C and 
assembly), to allow easy access for modifications needed to 
tailor the system to specific needs. This allows the program- 
mer to shorten the project development time by the 2 to 3 
months required to understand and service the R3000 ISA 
resources like cache, MMU and exception handling. 


IDT/kit includes IDT’s MicroMonitor, a very simple moni- 


tor for performing the initial debugging of new hardware. 
The only hardware which must be functioning to run the 


MicroMonitor is the CPU, EPROM and a serial port function. 


This allows for immediate debugging of hardware even 
when the DRAM memory is not functioning. 


IDT/KIT COMPONENTS 
1. MICROMONITOR: 


A small assembly language monitor to aid in debugging 


the hardware design. The MicroMonitor requires only that 
three hardware resources are functional: the CPU can 
execute instructions, the EPROM can provide instructions 
and the UART can send and receive characters. 
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2. IDT_CSU.S, THE START UP MODULE | | 
Supplies the initialization and set up. code necessary for — 
operation of the system. 


_¢ initialize the Status Register — 
a) clear parity error bit 
b) set Coprocessor 1 usable bit correctly 
c) clear all IntWasks enabled 
d) set kernel/user mode 
* set Cause Register 
a) clear software Interrupts Pending 
* clear bss area 
establish temporary uncached user stack 
* determine memory and cache sizes 
* establish permanent stack at Top of mone 
¢ flush | and D-Caches 
if there is a Translation Lookaside Buffer javalidat it 
-jnitialize library if IDT’s standard C Library is used 
initialize exception handlers 
jmp to users’ main() 


3. LIBIKIL.A, KERNEL INTEGRATION LIBRARY: 

The IDT Kernel Integration Library (libikil.a) is a library 
which can be linked to user programs to supply functions 
required to support the environment of the R3000 ISA 
family. This library is divided into four sections: Memory 
Handling, General Exception Handling, Floating Point Ex- 
ception Handling and Time Support Functions. 

a. memory handling - the full range of functions neces- 
sary to manage main memory, cached memory and 
the Translate Lookaside Buffers 

b. general exception handling - the functions used in 
enabling and handling any interrupt exceptions, 
hardware or software, asserted by the CPU 

c. floating point exception handling - provides the 
support for the Floating Point Unit, the R3010, or for 

- Emulation Software for floating point arithmetic , 
“strongly recommended” by the IEEE Standard 754- 
1985. It is transparent to the application code calling 
this interface whether hardware or software em ula- 
tion is being employed. 

d. Time support functions using the 8254 timer asa 
prototype | 


e 
e 
e 
e 


4. LIBILNK.A, THE IDT/SIM INTERFACE LIBRARY: 


the IDT/sim linking module which interfaces with all the 


functions available with IDT/sim not defined with IDT/kit or 
‘by the user 


5. LIBIC.A, ANSI STANDARD C LIBRARY: 

The IDT Standard C Library (libic.a) is a standard archive 
library which, when linked with the users’ filename.o files, 
provides the functions defined by the ANSI standard includ- 
ing standard |/O, String and Character functions, Utility 
functions, and memory allocation functions. 
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6. LIBIMATH.A, MATHEMATICS LIBRARY: 

The IDT Math Library (libimath.a) is a standard archive 
library which, when linked with the users’ filename.o files, 
provides the transcendental functions required for standard 
math processing. Whether hardware floating point or soft- 
ware emulation is be used is transparent to the application 
code calling this library. 


7. UTILITIES: 

Three utilities that execute under MIPS RISC/os are 
supplied to convert from compiler output from the MIPS coff 
file format to an S-record format and to downloading the S- 
records to a target board in either ASCII or binary forms. 
These utilities are need only for users of the MIPS C- 
compiler who do not have IDT/sim or MIPS SPP/e. 


HOW IDT/KIT IS USED 

IDT's Kernel Integration Toolkit includes a robust set of 
tools for the embedded controller developer. They are 
“oackaged” in accessible, modular containers, the IDT/kit 
libraries. The four libraries, arranged by function, supply 
most of the routines required by RISC applications. Only 
those libraries needed to resolve function calls must be 
entered on the link command line; the others are never 
accessed which establishes a fully modular environment. 

IDT/kit serves as an envelope for the installation’s appli- 
cation code. The source module, idt_csu.S, is linked first, 
then the development code and, finally, any kit libraries 
required to support function calls. This allows the developer 
to concentrate on the application and not waste resources 
re-developing the support routines. Although the libraries 
are provided complete and ready to link, source code for all 


the functions is also distributed to allow easy examination 
for information or as a template for additional routines. All 
the necessary Make/Batch files are included to facilitate any 
changes, additions or corrections. Some examples: 


strcpy (or any of the C library routines) - perhaps your 
installation has developed a super-algorithm. It isn’t 
quite ANS! Standard but does your job better. 
Simply edit the source in the clib subdirectory (or 
replace the one that is there), execute the makefile 
for your configuration (IDT/c or MIPS, Big or Little 
Endian, REAL or Emulation Mode) and the library, 
libic.a, now contains your code. Your module could 
also be placed on the system Link line before the 
corresponding library and the call would be resolved 
before the library is searched. 


Interrupt Handling - you want the default interrupt 
handler (it’s already there), but you need an addi- 
tional flag set. Edit the routine in the killib 
subdirectory, run the Make/batch file for your 
configuration (IDT/c or MIPS, Big or Little Endian, 
REAL or Emulation Mode) and the library, libikil.a, 
now contains “your “default interrupt handler. 


These kinds of simple modifications allow the IDT/kit 
routines to be tailored for a specific system without expend- 
ing the time to investigate and understand all the routines; 
only the section applicable to the application need be 
tweaked and the Make/Batch files provide easy guidelines 
for doing it. 
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IDT/kit Kernel Integration Toolkit 
IDT/kit FUNCTION LIST config_Dcache() ----- size of Data cache 
config__!cache() --------size Instruction cache 
CSU_IDT.S: START UP MODULE flush_Dcache() ------- invalidates entire Data cache 
Start() -------------------- Startup routine flush_Icache() -------- invalidates entire Inst cache 
LIBIKIL.A: | KERNEL INTEGRATION LIBRARY Sh a rl 
Programmable interval timer driver Assembly language TLB access 
install_timer_driver() install timer driver resettlb() ---------=------ invalidates tlb entry 
IB254init() -------------- timer driver init ret_tiblo() --------------- returns tlb entry lo reg 
I8254open() ----------- opens the device ret_tlbhi() --------------- returns tlb entry hi reg 
I8254ioctI() ------------- i/o control function ret_tlbpid() -------------returns tlb process ID field 
Assembly level exception handling - Set_tlbpid() ------------- sets current tlb pid 
disable_int() ----------- clear selected interrupts tlbprobe() --------------- probes tlb 
enable _int()--------- --- set selected interrupts tlbmapping() ----------- maps tlb entry 
exc_norm_code() ----general exception code Time support module 
exc_utlb_code() ------ UTLB Miss code | time_cmd_init() ------ starts clock 
exception() ------------- general exception code time_init() -------------- init timer drvr 
init_erc_vecs() -------- init vector code — timer_int ---------------- clock interrupt routine 
longjmp() EAA aga og go to setimp() point time() Soeesh Sou eo ees returns timer tics 
other_excp() Mair «aR eae Se handles other exception time it() en enn anne ne enneee times the selected function 
setjmp() ------------ ----- set setimp() state eS 
: : : Assm language Write Buff Routine 
High level exception handling wbflush() --------------- flushes the write buffer 
add_ext_int_func() --set default exc handler . 
clr_except_ptr() ------ clears setjmp pointer LIBILNK: IDT/SIM LINK LIBRARY 
config_memory() ----- size of main memory : 
extern_int() ------------ external interrupt code Cache Routines 
- exception() ------------- general exception code clear_cache() --------- clears portion of | and D 
get_except_ptr() ----- get execption pointer flush_cache() --------- flushes entire | and D 
init_tlb() ----------------- initializes TLB Character Routines . 
mem_exc_hdir() ------ memory exception code getchar() ---------------- inputs a character 
sae_errmsg() ---------- prints msg & exits putchar() ---------------- outputs a character 
set_except_ptr() ------ sets setjmp pointer showchar() ------------- makes character visible 
spurious_int() --------- unexpected ext interrupt | 
; Command Line Interpreter 
FPU interface module Cli() ----------------------- Command Line Interpreter 
fp_defaultHdlr() ------- default handler get_range() ------------ parses the range spec 
fp_disableTrap() ----- clears trap bits in fpcsr tokenize() sueeeresen seer parses the command line 
fp_enableTrap() ------ sets trap bits in fpcsr 
fp_init() ------------------ init floating Point Exit and Reenter Routines 
fp_int() ------------------ FP interrupt dispatcher _OXIt() oren-ne nnn nner eee exit & return to monitor 
fp_signal() ------------- user exception handler promexit() -------------- exit & return to monitor 
fpclr_stickybits() ------ clear sticky bits in fpcsr feinit() ------------=------ reinitializes monitor 
fpget_excregs() ------ get Exc Regs reset() ------------------- resets prom monitor 
fpget_fpcsr() ----------- get FP Control/Status restart() ----------------- restarts the debug monitor 
fpget_RM() ------------- get rounding mode fpcsr . 
fpget_stickyBits() ----get sticky bits fpcsr hi Screen oMiine ts Helo S 
fpset_fpcsr() ----------- set FP Control/Status elp() PIS e pee 
fpset_RM() ------------- sets rounding mode ROUTINES TO EXTEND IDT/SIM 
fpset_stickyBits() ----sets sticky bits in fpcsr 
fpset_excregs() ------- set Exc Register buffer install_commands() -adds user commands 
Assembly language FPU access install_immediate_intnstalls user interrupt 
clr CAUSE() ---------- clears SW bits in CAUSE Sten ew ey listal= new device 
get_CAUSE() --------- ‘atirne-contentcot CAUSE install_normal_int() --installs user interrupt 
get_fpcsr() ------------- returns FPU csr Routines for low level I/O 
get_cpOepc() ---------- gets epc ClOS@() ----==----=-22--2" closes an open device 
get_STATUS() ala status register contents open() ------------2------ opens a device 
set_CAUSE() --------- sets CAUSE Register read () -----------2ee2=--"- reads data from device 
set_fpcsr() ------------- sets FPU csr Write() ------------------- writes data to an device 
Functions affecting I/D Caches /O Control Function 
clear_Dcache() ------- invalidate portion of Deache j0ctl() --------------+----- sets I/O flags / calls drivers 
clear_Icache() -------- invalidate portion of Icache 
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IDT/kit Kernel Integration Toolkit . PRODUCT BRIEF 
Routines to save /restore context SUISPN() ----<¢0r-eee-n-= len of prefix of str 
longjmp() --------------- restores setjmp context StrcSpN() ---------------- len of prefix of str 
setjmp() ----------------- saves the current context strpbrk() ---------------- ptr to first occur of any char 
. . ; Strstr() ------------------- string a occurs in string b? 
Memory configuration routines Strtok ()secesesepeeiece-ce return tokens 
get_mem_conf() ----- returns mem configuration strtod() ------------------ convert string to a double 
set_mem_conf() ------ sets the mem configuration strtol() ------------------- convert string to long int 
Formatting printroutine — LIBIMATH.A: TRANSCENDENTAL MATH LIBRARY 
Pili) eaissrnrecreees: formatting print routine acoSh() ------------------ inv hyperbolic cosine of x 
Dummy routines for libic BROS |) sora cost (x) 
_init_file() -------------- dummy file routine ASIN See sin -1(x) ee 
_init_sbrk() ------------- dummy sbrk routine ASIAN () eooeseseniennte= inverse hyperbolic sine of x 
ALAN() mneneenneneenne nen tan -1 (x) 
String routines AtaN2() --------e-----=-"- tan -1 (x/y) 
atob() -------------------- Ascii string convert atan2() ------------------ tan -1 (wy) 
Qets() -------------------- gets string function | atanh() ------------------ inv hyperbolic tangent of x 
DUtS() ----------2-nn eee outputs string to I/O CabS() ------------------- complex absolute value 
strcat() ------------------ concatenates two strings OXP() ---------n een ee exponential function e*x 
stromp() ----------------- compares two strings hypot() --------------=-+- sqrt (x*x + y*y) 
StICpy() ----------"------ copies one string to another Z_abS() ------------------ double-complex absolute 
strlen() ------------------ returns length of string Cbrt() ----------------29- cube root of x 
LIBIC.A: | ANSI STANDARD C LIBRARY ca RS i laehior renee ne 
AIDS | ji exh senteetsesend atten absolute value of integer XPM () -----s2-02-2--2"= exponent (x - 1) 
BLON() eseaceaseeorsieeetse2 fp value of an Ascii string C@iI() ---------------=-=-+- smallest int not < x, double 
ali) integer value of Ascii str floor() -------------------- largest int not > x, double 
BION() Ses setenceaeceres long value of Ascii str Fint() --------------------- nearest x in dir of round — 
bsearch() --------------- binary search of a array | fMOd() ------------------- fp remof x/y, sign of x 
CIV() cnnnn-nnnnn-ae2-2-=-- rem & quot of int division Po tan -1 (x) 
ferror() ------------------ error during a file operation? CGa() ete cos of x 
QIOXIL() aasesteattest ooo routines called at exit time @XP() ------nnnn-neee eee = exponential function ex 
OXIt() aes s serecsoneses Terminate with status |Og() ----2---2--22--22--2"" natural logarithm In(x), x>0 
fOPON() =--s++e-++hwe-n- open file/ret file stream ptr. log10() ------------------ base 10 logarithm, x>0 
fclose() ------------------ close a file A eee sine of x 
fdopen() ---------------- open stream SQIt() --------n-e---nnneee square root of x, x>= 0 
labs() -------------------- absolute value of long arg fanQ eee tan of x 
IV. () seeesererseesesscaes rem & quotient of division XtOi() --------------------- raises xX to integer power, i 
MOG ()e-Fattineeseseneennse free allocated memory sim_fpint() ------------- simulate IEEE standard trap 
Malloc() ----------------- memory allocation sim_unint --------------- sim FP Unimplemented Op 
realloc() ----------------- reallocation of memory lOg() --------------"""--"-- natural logarithm In(x), x>0 
ftscanf() ------------------ read data from a file l0g10() ------------------ base 10 logarithm, x>0 
printf() ------------------- display data on the std I/O fog 1 pt) ssescsasescetons log (1 +x) 
qsort() pe We a hose quick sort routine log L() esubeniatcacadas< og(1 + x) -2s/s 
rand() ------------------- -generates random number pow() en ee eee xAy 
srand() ------------------ seed for random num genr COS() ------------2--=====- cos of x 
sbrk() -------------------- mem allocation bp routine Bt sine of x 
Scanf() ------------------ read data from standard input A hyperbolic sine of x 
SPHIAU ():-te=sseseneeneese output data into a string copysign() -------------- returns x with sign of y 
sscanf() eee read data from a string drem() Site eee dee trees X- n*y, integer nearest n 
mememp() ------------- compare two memory arrays finite); 1 = real x; 0 = INF or NAN x 
memcpy() -------------- memory array Copy logb() -------------------- exponent of x4n 
memmove() ------------ memory array move scalb() ------------------- x * (2**n) computed for n 
memchr() --------------- ret ptr to first matched char Sih) <-seeeceeseeeneecs square root of x, x>= 0 
memset() --------------- place a char in memory array ian) sae tan of x 
Strlen() ------------------ return string length @XPM1 () -----------===== exponent (x - 1) 
StrCMP() --------------=-- strings are identical? tanh() -------------------- hyperbolic tangent of x 
StrCPy () ~-----ne---o-o--- copy string fabs() -------------------- absolute value of number 
strncpy() sia peat oe ata copy n characters of a string frexp() Soccuntteiwansscccs returns mantissa;exp in “ptr 
SUNCIIN() sn-sareeeseeecaer- ret ptr to first match of a char isnan() ------------------ tests for floating point NaN 
strchr() ------------------ ret ptr to last match of a char Idexp() -----------+---08- returns quantity *2%exp 
strcat() ------------------ concatenate a strings BOW) eeeessteece eee xAy 
strncat() ----------------- concatenate strings 
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ORDERING INFORMATION ee os i 

To order the IDT/kit Developer’s Package, order the Internal Source License AND orderthe software on 1 the anaroniiers 
source media. The License Agreement is available from your local IDT sales office and is also published in IDT’s 
development tools catalog. 


LICENSES 

Internal SOurCe LICCHSO scsi cccicscssiscccceasctiasinacecadustcaaessasessenicics densearvededddesapevacewsaiseuvvacuoeaanons 7RS909SLV 
Permits purchase of up to six copies of source code (any media combination) and use of source code 
to develop run-time binaries on up to six machines at a time, but does not permit inclusion of the run 
time. code in an end product. 

Limited Binary Distribution Rights ................csscsccsssssessecsssesesecessnessseesessetessneessasecsenssoes 7RSS909BLP-L 
Extension to Internal Source License to permit inclusion of binary code into end product. Source 
License must be referenced on order or ordered simultaneously. This license permits up to 100 copies 

| to be distributed royalty-free. Additional copies are subject to the royalty . 

UnLimited Binary Distribution Rights ................csssssssssssscssesseessessesssssessersesarsessesanesaeens 7RS909BLU-L 
Extension to Internal Source License to permit distribution of unlimited number of copies of binaries 
derived from the code. 


MAINTENANCE | 
Maintenance Agreement ............cscssssesssseees inca basaiuay uluanawovadasocadencatiadiusacbinatsl jeadenahadenepesnes 7RS909SSY 
One year free updates. 


SOURCE MEDIA | 
IDT/kit source. code can be compiled with either the MIPS C compiler or with IDT/c version 3.5 or later. 
Earlier versions of IDT/c cannot compile this code. The products listed below are media only and 
must be purchased with license 7RS909SLV above. 


Source FOF 986, MS-DOS s22.c224 aataatiatssaanansiandessdentienasaintenated insaunniaasi 7RS909SBF-L 
Compile with IDT/c C-Compiler. Shipped with both 1.2 MB 5.25" and 1.44 MB 3.5" diskettes. 
Source for 386 PC SCO Mein scssescscsiccivstitcccascstivenedecansectecveievsndessdanatexbanea caesdeveadigentass 7RS909SXX-L 
Use with IDT/c C-Compiler. 

Source for IDT MacStation, On Mac DISC ..............::.sscssssscsssessesssscsessctsesseesssesesesnecseoaes 7RSS09SMD-L 
Use with MIPS C Compiler supplied with MacStation or with IDT/c. 

Source for MIPS or SPARC machine, DC6150 QIC TAR Tape ............sssscsseerseeees ieveests 7RS909SUU-L 


Use with MIPS C Compiler or with IDT/c. 
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MacStation™ 3 


IDT7RS503 


RISC WORKSTATION INA 


Macintosh® 


Integrated Device Technology, Inc. 





FEATURES: 
¢ 15 or 25 MIPS RISC Computer Add-In Board for 
Macintosh || Computers 


Includes AT&T UNIX® SVR3 Operating System 
with BSD 4.3, NFS, X11, and Motif 


Supplied with MIPS C-Compiler, Assembler, and 
Symbolic Debugger 


Uses all Macintosh peripherals for I/O 


Includes Macintosh-independent SCSI and 
serial I/O ports 


Multifinder and System 7 compatible 


Available as add-in board or as completely 
configured systems 


R3000 COMPUTER PLUGS INTO 


A MACINTOSH II 

IDT’s MacStation™ 3 is a high performance R3000- 
based workstation consisting of a MIPS® R3000 RISC CPU 
board that plugs into a Macintosh II computer, a complete 
AT&T SVR3 UNIX® operating system, and a collection of 
Macintosh programs used to communicate between the 
UNIX and Mac operating systems. 

The UNIX software is MIPS RISC/os v 4.5.2, MIPS port of 
AT&T Unix. It is supplied with TCP/IP, NFS®, X11G3, and 
Motif. The UNIX software can support multiple user ses- 
sions, using either Telnet or X, running one or more Macs or 
external terminals. 

A Macintosh application, IDT/console, is used as a con- 
sole terminal and I/O handler for the UNIX OS. Opening the 
application creates a terminal window on the Macintosh 
from which UNIX can be booted. File 1/0 commands from 
UNIX are intercepted by IDT/console, which reads and 
writes UNIX files encapsulated inside Macintosh files. The 
encapsulated files can reside on the same disks as Macin- 
tosh files without partitioning. With a single command, a file 
can be moved between the Macintosh and Unix environ- 
ments. An optionally available package, Intercon’s NFS/ 
share™, can be used to mount the Unix files on the Macin- 
tosh. 

Other Macintosh software includes IDT/envy™, an Ex- 
tension that provides TCP/IP paths among the Mac, UNIX, 
and an Ethernet card; NSCS Telnet, a telnet terminal appli- 
cation for the Macintosh; Consulair EDIT, a Macintosh text 
editing program; and MacTCP. 
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IDT7RS503. 
Salen 3 RISC WORKSTATION 


MACSTATION 3 HARDWARE 
RISC CPU Card 


The MacStation is available with either of two CPU cards. 


One is approximately 15 MIPS performance, and includes 
8MB of local DRAM, 2 serial I/O ports and a SCSI port. The 


processor on this card runs at 20MHz, and includes 16KB 


each of Instruction and Data Cache. The other card is 
approximately 25 MIPS in performance, and includes 16MB 
of local DRAM, 2 serial I/O ports, and a SCSI port. The 
‘processor runs at 25MHz, and includes 64KB each of 
Instruction and Data Cache. 

The SCSI port can be used to connect hard disks with the 
UNIX file systems on them. This provides higher perfor- 
mance than that available when the Mac’s disks are used for 
the UNIX files. | 

The serial I/O ports can be used for a separate console 
terminal or for printers or other common I/O devices. Drivers 
are included for a number of common devices. 





RISC CPU BOARD 


I-CACHE | D-CACHE 


~ MONITOR. AND 
_ NuBus INTERFACE 





Macintosh Il 


CONTROLLER 


Hard Disc 





PRODUCT BRIEF 

MACSTATION 3 SOFTWARE 
UNIXO/S 

The UNIX OS is AT&T SVR3 with BSD 4.3 extensions 
and fast file system. This is a full-featured UNIX, including 
on-line manuals, SMTP, reconfiguration files, and the MIPS 
C-compiler. Any application which runs under MIPS RISC/ 
os will run directly on this OS. Ports of other SVR3 applica- 
tions are easy, but generally require recompiling for the 
MIPS hardware. The UNIX software includes NFS file shar- 
ing services, the X11 library, and Motif (MIPS RISCwindows). 
The UNIX file systems are encapsulated in Macintosh files 
and may exist on any number of Mac disks. Swap space is 
preset to 40 MB, but can Pe changed easily. 
C Compiler 

The C compiler included with the MacStation i is tHiG MIPS 
C compiler, noted for extremely efficient optimization of 
code. A switch in the compiler determines whether K&R or 
ANSI C is supported. 


Hard Disc 
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Drivers 

Source code is supplied for a number of I/O devices, 
including terminal, disk, and network. A reconfiguration 
directory and make files are available to rebuild the kernal 
with new or modified drivers. 


MACINTOSH SOFTWARE 
All supplied Macintosh software is compatible with Sys- 
tem 6.07 and with System 7.0. 


IDT/console 

IDT/console is an application that runs on the Macintosh 
under MultiFinder, and provides a terminal emulation win- 
dow for the UNIX console, as well as I/O services to UNIX for 
file systems on Macintosh disks. 


MacTCP 
This Apple communication tool adds TCP/IP capability to 
the Macintosh. 


IDT/envY 

IDT/envY is a Macintosh extension that provides the 
UNIX and Macintosh sides two different IP addresses, so 
they can communicate with each other using TCP/IP proto- 
cols. This communication takes place over the NuBus, so it 
is extremely fast and reliable. No Ethernet card is required 
for the Macintosh-UNIX path. IDT/envY also permits a 
single Macintosh Ethernet card to serve both CPUs. 


Telnet | 

Telnet is a Macintosh application that opens a terminal 
window on the Macintosh through which new UNIX sessions 
can be started. Multiple Telnet windows can be open at 
once, each running different processes. 


EDIT 

Consulair EDIT is a Macintosh text editing program, well 
suited for writing C source code. The text files are easily 
passed to the Unix side for compilation and debug. 


NFS for the Macintosh 

NFS communications is included with the UNIX package, 
so files can be mounted from remote servers. Intercon’s 
NFS/share™ is available as an option on the Macintosh 
side. NFS/share allows UNIX file systems to be mounted 
and opened on the Mac like AppleShare volumes. 


X-Windows 

An X-11 Library and Motif client software is included for 
the UNIX side. Apple’s MacxX is available separately for the 
Macintosh side so that X-windows can be opened on the 
Macintosh screen. 


Major Components of UNIX file systems and approximate size in MBytes. 
Swap file (typically 40 MB) not shown 


/(1 $0) 


| 
/usr (174) 


letc (4) /dev (1) 


/lib (20) 


/windows (30) /RISCos (15) 


(Unix Libs) (Drivers, 


Reconfig files) 


(X11 libs) (Compatibility 


with older versions) 


/ete (12) 


(Commands) 


[bin (4) 


/bin (10) 


/man (9) /new (24) 


(Unix Cmnds) (BSD cmnds 


(Unix and 
BSD cmnds) 


(On-line 
Documentaton) 
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Integrated Device Technology, Inc. 


CUSTOM AND 
OFF-THE-SHELF 


RISC ASSEMBLIES 








INTRODUCTION 


IDT’s RISC Subsystems Division provides design and 
manufacturing services for subassemblies and complete PC 
boards using any of the company’s RISC CPUs, from the 
R3051 to the R4000. Several CPU Modules are available as 
standard off-the-shelf products for either prototype develop- 
ment, for initial software porting, or for usein volume production. 
Four of IDT’s standard R3000-based modules are shown on 
the following pages. 


FOR PROTOTYPE DEVELOPMENT 


For quick development of systems, use a prototyping 
platform. Thereis a completely assembled and tested prototype 
development platform available for each of IDT’s standard 
CPU modules. These platforms include the module, main 
memory, EPROM containing IDT/sim debug monitor, serial 
/O for control and downloading, and an expansion bus for 
adding additional hardware to the design. The prototyping 
platforms require only a 5V supply and a terminal to be 
operational. Code compiled on the software development 
system can be downloaded, executed, and debugged. 


Concept Review 


Detailed Design 


Product Spec. 
Design Review | 


Diagnostics 


Fab a-protos 


Debug and | 
Functional Verification. | 


; 


FOR PRODUCTION 


You can design your system around one of the standard 
CPU modules to save engineering time and simplify manufac- 
turing. The modules are carefully engineered to support the 
high speeds present around the CPU, and are completely 
tested and burned-in prior to shipment. To achieve high 
performance, the modules are typically built using all surface 
mount components on both sides of an 8 to 14 layer FR-4 PC 
board. Some modules include ASICs to add features and 
reduce size. High pin-count and fine-pitch (<20 mil) compo- 
nents are routinely assembled on our production lines. Sev- 
eral different interconnect systems are used to provide reli- 
able connection between the module and the mother board. 


CUSTOM MODIFICATION OF 
STANDARD MODULES 


lf one of our standard modules isn't exactly what you need, 
we can make modifications for you. The modifications can be 
as simple as changing the reset conditions or clock rate, or 
modifying the on-board state machine behavior. Or we can 
design a completely new module for to meet your exact 
specifications. In either case, we will supply you with test 
hardware and diagnostic software so we can work together to 
speed your program along. 


Design Review Il, 
Mfg Review | 


Fab B-protos 


Test Software, 


Test Fixtures 
Mfg Tooling 





Mfg Review Il 
Pilot Production 
Qualification Testing 


Volume Production 
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ENGINEERING AND MANUFACTURING 
SERVICES 


Use our Engineering Experience 

IDT can design and manufacture board-level products to 
meet your needs. We have designed and put into production 
more MIPS-based products than anyone else in the world, 
ranging from 40 MHz CPU modules to high-volume low-cost 
embedded controllers based on the 3051. 

When you hire IDT to design your board, you take advantage 
not only of an experienced design team with a fully equipped 
hardware lab and support staff, but also IDT’s extensive 
software porting and debug experience. Together with your 
engineers, we will define the optimum architecture, review the 
detailed design, develop diagnostic tools, and debug the 
prototype hardware and software. The same team of software 
engineers that developed IDT's powerful software tools will 
help get your software running on your own custom hardware. 


Production Assembly and Test 

Once your design has been completed, IDT will create a 
complete manufacturing package, and put the product into 
production in our own PC board assembly plant. | 

Initial units from the line will be qualified for reliability, 
manufacturability, and conformance to the product specifi- 
cation. On-going production can then commence with deliv- 
eries according to your needs. IDT can manufacture quanti- 
ties as small as a few hundred per year, or as oe as 
hundreds per week. 

Using IDT’s manufacturing resources offers the advan- 
tages of experienced manufacturing and test personnel, ready 
access to the development engineers for problem-solving, 
and close co-ordination with the semiconductor divisions of 
the company to insure timely delivery of components. 
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R3000 CPU MODULE IDT7RS109 





FEATURES | R3000 MODULE FOR HIGH PERFORMANCE 


* 64KB each of Instruction and Data Cache CPUS AND MULTIPROCESSOR SYSTEMS 
* High Speed: 33MHz The IDT7RS109 is a complete reduced instruction set 
* Includes R3010 Floating Point Accelerator computer (RISC) CPU, based on the MIPS R3000 RISC 
- 1-word Read Buffer; 4-word Write Buffer processor, and supplied on a small fully-tested high-density 
* Supports R3000 Multiprocessor Features plug-in module. The module includes the R3000 CPU, the 
* On-Board Parity Check and Generate R3010 Floating Point Accelerator, 64kB each of data and 
* Four or Eight-word block refills instruction cache memory, a single word read buffer and a 
* On-board oscillator, delay line, and reset four-word write buffer. Clock generation, reset, control, parity, 
circuitry and interrupt functions are included on the module to simplify 
* 100% burn-in and functional test at rated speed the remainder of the system design. 


The 109 module includes a latch to hold an external 
address for snooping in the D-cache and is designed to 
support the R3000’s multiprocessor features. | 

The module is constructed using surface mount devices on 
a 5.2" by 5.2" epoxy laminate board, and is connected to the 
user’s system via 195 pins located in two pin row regions on 
the board. ae 
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IDT7RS109 
R3000 RISC CPU MODULE 


PRODUCT BRIEF 





RELATED PRODUCTS 


Prototyping System | 


The 7RS109 module can be placed into immediate service 
using our flexible 7RS409 Prototyping Platform. The system 
includes .a 7RS109 module and_ two additional boards: a 
general purpose CPU board, and a personality card that 
interfaces the module to the CPU board. | 

The CPU board contains 1MB of main memory, 256kB of 
EPROM, two RS232 serial ports, an 8254 counter/timer, and 
an 8-bit parallel port accessible through a dual port RAM. Four 
50-pin connectors provide access to all the address, data, and 
control signals for external connection to additional narawae 
on, for oes a eae peal. | 


The system includes IDT’s Software Integration Manager,. 
which provides facilities for downloading code, exe mOlg 
memory, and stepping through programs. » 

The personality card is on a separate board, and provides 
a bed for the module, necessary control signals, and connec- 
tors for an HP16500 Logic Analyzer. | 

Code forthe R3000 canbe created onaMIPS davelbonent 
system, on IDT’s MacStation™ system, or using IDT’s PC- 
based cross assembler and compiler products. Assembled 
code can be downloaded into the Prototyping Syl for 
execution and debug. 





NOTE: 


The card on the left is the personality card with a module: the card on the right is the sone purpose CPU. 
Figure 1. A 7RS409 Prototyping Platform. 


ORDERING INFORMATION 


Contact your local IDT Sales Representative for Ordering 
Information. 
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R3000 CPU MODULE IDT7RS110 





Integrated Device Technology, Inc. 





FEATURES R3000 MODULE FOR GENERAL USE IN 


¢ High Speed: 33MHz SMALL SYSTEMS 

> Small size: 3.3" x 4.1" The IDT7RS110 is a complete reduced instruction set 
* 32KB each of Instruction and Data Cache computer (RISC) SubSystem, based on the MIPS R3000 
* Optional R3010 Floating Point Accelerator RISC processor, and supplied on a small fully-tested high- 
* Single-word Read and Write Buffers density plug-in module. The size and performance of the 
* On-Board Clock, Reset, and Parity Logic 7RS110 make it ideal for embedded aplications such as laser 
* 100% burn-in and functional test at rated speed printers and X-terminals. The 7RS110 includes an R3000 


Instruction Set CPU, 32kB each of instruction and data cache. 
a single word read buffer, a single word write buffer and the 
R3010 Floating Point Accelerator (optional). 

Cache misses are handled with single word memory ac- 
cesses or with eight or sixteen word block refills. 

The module is constructed using surface mount devices on 
a two-sided epoxy laminate board, and is connected to the 
user's system via six rows of 36 pins each. 








7RS110 Module. Actual Size 3.3" x 4.1" 
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IDT7RS110,0 2 
R3000 RISC CPU MODULE 


- PRODUCT BRIEF 





RELATED PRODUCTS 


Prototyping System 

_ The 7RS110 module can be placed into immediate service 
using our flexible 7RS410 Prototyping Platform. The system 
includes a33MHz 7RS110 module and two additional boards, 
a general purpose CPU board, and a personality pang that 
interfaces the module to the CPU board. 

The CPU board contains 1MB of main memory, 256K of 
EPROM, two RS232 serial ports, an 8254 counter/timer, and 
an 8-bit parallel port accessible through a dual port RAM. Four 
50-pin connectors provide access to all the:address, data, and 
control signals for external connection to additional nalawale 
on, for exampre ee EP pedey: ae 


The system clades IDT’s IDT/sim Susie Integration 
Manager, which provides facilities for downloading code, 
examining memory, and stepping through programs. 

The personality card is on a separate board, and-provides 
a bed for the module, necessary control signals, and connec- 
tors for an HP16500 Logic Analyzer. 

Code for the R3000 can be created ona MIPS savelonueel 
system, on IDT’s MacStation™ system, or usingIDT/c, IDT’s 
Multi-Host cross compiler product, available for a variety of 
machines. Assembled code can be downloaded into the 
Prototyping System for execution and debug. | 





NOTE: 


The card on the left is the personality card with a noida: the card on the right is the general purpose CPU. 


7RS410 Module sia Platform 


ORDERING INFORMATION 


Contact your local IDT Sales Representative for Ordering 


Information. 
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Integrated Device Technology, Inc. 





HIGH PERFORMANCE 
R3000 CPU MODULE 





ADVANCE 
INFORMATION 
IDT7RS114 





FEATURES 


* Cache Size: 64kB Instruction, 64kB Data 

¢ Processor Speeds up to 40MHz 

* Includes R3010 Floating Point Accelerator . 

¢ 8-word Write Buffer 

¢ Small Size 3.6" x 4.0" 

¢ Block refill option of 1, 4, 8, 16, or 32 words 

¢ On-board oscillator, delay line, and reset circuitry 
¢ 100% burn-in and functional test at rated speed 
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R3000 MODULE FOR HIGH PERFORMANCE 
CPUS AND MULTIPROCESSOR SYSTEMS 


The IDT7RS114 is a complete reduced instruction set 
computer (RISC) CPU, based on the MIPS R3000 RISC 
processor, and supplied on a small fully-tested high-density 
plug-in module. The module includes the R3000 CPU, the 
R3010 Floating Point Accelerator, 64kB each of data and 
instruction cache memory, an eight-word write buffer. Clock 
generation, reset, control, and interrupt functions are included 
on the module to simplify the remainder of the system design. 

The IDT7RS1 14 module includes a latch to hold an externa! 
address for snooping in the D-cache and is designed to 
support the R3000’s multiprocessor features. 

The module is constructed using surface mount devices on 
a 3.6" by 4.0" epoxy laminate board, and is connected to the 
user's system via a. 210-pin Augat connector. 





7RS114 (Actual Size is 3.6" x 4.0") 
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-7RS114 — a, 
R3000 RISC CPU MODULE 


PRODUCT BRIEF 





RELATED PRODUCTS 


Prototyping System 

The 7RS114 module can be placed into immediate service 
using our flexible 7RS414 Prototyping Platform. The system 
includes a 7RS114 module and a general purpose CPU board 
that interfaces the module to the CPU board. 

The CPU board contains 4MB of main memory, 256K of 
EPROM, DUART, SCSI interface, Centronics interface, an 
8254 counter/timer, and connectors for an HP16500 Logic 
Analyzer. Four 50-pin connectors provide access to all the 


7RS414 BLOCK DIAGRAM 


7RS114 
CONTROL 
PALS 
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| DATA. 
BUFFERS 
8-FCT804 


ORDERING INFORMATION 


7RS114 AND 7RS414 


Contact your local IDT Sales Representative for ordering 
information on the 7RS114 and 7RS414. 


_ address, data, and control signals for external connection to 


additional hardware on, for example, a wire-wrap board. 

The system includes IDT’s IDT/sim System Integration 
Manager, which provides facilities for downloading code, 
examining memory, and stepping through programs. 

Code for the R3000 can be created ona MIPS development 
system, on IDT’s MacStation™ system, or using IDT/c, IDT's 
Multi-Host cross compiler product, available for a variety of 
machines. Assembled code can be downloaded into the 
Prototyping System for execution and debug. — 


~ EPROM 
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ALABAMA 


IDT 

555 Sparkman Dr., 
Ste.1200-D 
Huntsville, AL 35816 
(205) 721-0211 


ALASKA 


Thorson Co. Northwest 
Bellevue, WA 
(206) 455-9180 


ARIZONA 


Western High Tech Mktg. 
Scottsdale, Az 
(602) 860-2702 


ARKANSAS 


IDT 

(S. Central Regional 
Office) 

14285 Midway Rd., Ste. 
100 

Dallas, TX 75244 
(214) 490-6167 


CALIFORNIA 


IDT 

(Corporate Headquarters) 
2975 Stender Way 

P.O. Box 58015 — 

Santa Clara, CA 95052 
(408) 727-6116 


IDT 

(Western Headquarters) 
2975 Stender Way 
Santa Clara, CA 95052 
(408) 492-8350 


IDT 

(SW Regional Office) 
6 Jenner Dr., Ste. 100 
Irvine, CA 92718 
(714) 727-4438 


IDT 

(SW Regional Office) 
16130 Ventura Bivd., 
Ste. 370 
Encino, CA 91436 
(818) 981-4438 


Quest-Rep 
San Diego, CA 
(619) 622-5040 


CANADA 
(EASTERN) 


CMT Renmark, Inc. 
Kanata, ONT 
(613) 591-9555 


CMT Renmark, Inc. 
Mississauga, ONT 
(416) 612-0900 


CMT Renmark, Inc. 
Pointe Claire, Quebec 
(514) 694-6088 


DOMESTIC SALES REPRESENTATIVES 


CANADA 
(WESTERN) 


Thorson Co. Northwest 
Bellevue, WA 
(206) 455-9180 


COLORADO 


IDT 

(NW Regional Office) 
1616 17th St, Ste. 370 
Denver, CO 80202 
(303) 628-5494 


Thorson Rocky Mountain 
Englewood, CO 
(303) 773-6300 


CONNECTICUT 


SJ Associates 
Rockville Centre, NY 
(516) 536-4242 


DELAWARE 


IDT 

(NE Regional Office) 
Horn Point Harbor 
105 Eastern Ave., Ste. 
201 

Annapolis, MD 21403 
(301) 858-5423 


S-J Mid Atlantic, Inc. 
Mt. Laurel, NJ 08054 
(609) 866-1234 


FLORIDA 


IDT 

(SE Regional Office) 
1413 S. Patrick Dr., Ste. 
10 

Indian Harbor Beach, FL 
32937 

(407) 773-3412 


IDT 

(SE Regional Office) 
18167 U.S. 19 North 
Ste. 455 

Clearwater, FL 34624 
(813) 532-9988 


IDT 

(SE Regional Office) 
1500 N.W. 49th St, 

Ste. 500 

Ft. Lauderdale, FL 33309 
(305) 776-5431 


GEORGIA 


IDT 

(SE Regional Office) 
18167 U.S. 19 North 
Ste. 455 

Clearwater, FL 34624 
(813) 532-9988 


HAWAII 


IDT 

(Western Headquarters) 
2975 Stender Way 
Santa Clara, CA 95052 
(408) 492-8350 


IDAHO 
(NORTHERN) 


Anderson Associates 
Bountiful, UT 


- (801) 292-8991 


IDAHO 
(SOUTHERN) 


Thorson Rocky Mountain 
Salt Lake City, UT 
(801) 942-1683 


ILLINOIS 


IDT 

(Central Headquarters) 
1375 E. Woodfield Rd., 
Ste. 380 

Schaumburg, IL 60173 
(708) 517-1262 


Synmark Sales 
Park Ridge, IL 
(708) 390-9696 


INDIANA 


Arete Sales 
Ft. Wayne, IN 
(219) 423-1478 


Arete Sales 
Greenwood, IN 
(317) 882-4407 


IOWA 


Rep Associates 
Cedar Rapids, IA 
(319) 373-0152 


KANSAS 


Rush & West Associates 
Olathe, KS 
(913) 764-2700 


KENTUCKY 


Arete Sales 
Ft. Wayne, IN 
(219) 423-1478 


LOUISIANA 


IDT 

(S. Central Regional 
Office) 

14285 Midway Rd., Ste. 
100 

Dallas, TX 75244 
(214) 490-6167 


MAINE 


IDT 

(Eastern Headquarters) 
#2 Westboro Business 
Park 

200 Friberg Pkwy., 

Ste. 4002 

Westboro, MA 01581 
(508) 898-9266 


MARYLAND 


IDT 

(NE Regional Office) 

Horn Point Harbor 

105 Eastern Ave., Ste. 201 
Annapolis, MD 21403 
(301) 858-5423 


MASSACHUSETTS 


IDT 

(Eastern Headquarters) 
#2 Westboro Business 
Park 

200 Friberg Pkwy., 

Ste. 4002 

Westboro, MA 01581 
(508) 898-9266 


MICHIGAN 


Tritech Sales 
Farmington Hills, MI 
(313) 442-1200 


MINNESOTA 


IDT 

(N. Central Regional Office) 
1650 W. 82nd Street 

Ste. 1040 

Minneapolis, MN 55431 
(612) 885-5777 


OHMS Technology Inc. . 
Edina, MN 
(612) 932-2920 


MISSISSIPPI 


IDT 

(SE Regional Office) 
1413 S. Patrick Dr., 

Ste. 10 

Indian Harbor Beach, FL . 
32937 

(407) 773-3412 


- MISSOURI 


Rush & West Associates 
St. Louis, MO 
(314) 965-3322 


MONTANA 


Thorson Rocky Mountain 
Englewood, CO 
(303) 773-6300 


NEBRASKA 


IDT 

(Central Headquarters) 
1375 E. Woodfield Rd., 
Ste. 380 

Schaumburg, IL 60173 
(708) 517-1262 


NEVADA 
(NORTHERN) 


IDT 

(Western Headquarters) 
2975 Stender Way 
Santa Clara, CA 95052 
(408) 492-8350 


NEVADA 
(SOUTHERN) 


Western High Tech Mktg. 
(Clark County, NV) 
Scottsdale, AZ 

(602) 860-2702 


NEW HAMPSHIRE 


IDT 

(Eastern Headquarters) 
#2 Westboro Business 
Park 

200 Friberg Pkwy., 

Ste. 4002 

Westboro, MA 01581 
(508) 898-9266 


NEW JERSEY 


IDT 

(NE Regional Office) 
One Greentree Centre, 
Ste. 202 

Marlton, NJ 08053 
(609) 596-8668 


SJ Mid-Atlantic, Inc. 
Mt. Laurel, NJ 
(609) 866-1234 


NEW MEXICO 


Western High Tech Mktg. 
Scottsdale, Az 
(505) 884-2256 


NEW YORK 


IDT 

(NE Regional Office) 
250 Mill St, Ste.107 
Rochester, NY 14614 
(716) 777-4040 


Quality Components 
Buffalo, NY 
(716) 837-5430 


Quality Components 
Manlius, NY 
(315) 682-8885 


SJ Associates 
Rockville Centre, NY 
(516) 536-4242 


NORTH CAROLINA ~ SOUTH CAROLINA 
Tingen Technical Sales IDT IDT .”” Anderson Associates Thorson Co. Northwest ~ . 
Raleigh, NC (NW RegionalOffice) (SE Regional Office) - Bountiful, UT =. Bellevue, WA . 
(919) 870-6670 15455 NW Greenbriar 1413 S. Patrick Dr, Ste. 10 (801) 292-8991 (206) 455-9180 
os Pkwy : “Indian Harbor Beach, FL 7 mo, 
NORTH DAKOTA Ste. 210 32937 Thorson Rocky Mountain WEST VIRGINIA. 
Beaverton, OR 97006 » (407) 773-3412 Salt Lake City, UT 84121 . 
Coe nn moogy ine. . (603) 690-8978 (801) 942-1683 Norm Case Associates 
OHIO -(WESTERN) sina, MN IT | 
Rocky River OH 3 Rocky River, OH #2 Westboro Business. = Synmark Sales 
(216) 339-0 he (216) 333-0400 TENNESSEE Park — Park Ridge, IL 
| - IDT | ea pee Pkwy., . (708) 390-9696 
CG 1 e. 
(S. Central Regional Office) S-J Mid-Atlantic felipbiieatee oo Thorson Rocky Mountain 
14285 Midway Rd., Ste. Mt. Laurel, NJ 08054 a VIRGINIA Englewood, CO- 
100 (609) 866-1234 TEXAS ior (303) 773-6300 
Dallas, TX 75244 (NE Regional Office) 
(214) 490-6167 RHODE. ISLAND IDT . . Horn Polat Harbor 
IDT ( penta seat Office) 405 Eastern Ave., Ste. 201 
(Eastem Headquarters) 100 EY Rte: pl peer ee 1403 
gaa aang Dallas, TX 75244 
200 Friberg Pkwy. (ETE A000 87 
Ste. 4002 
Westboro, MA 01581 
_ (508) 898-9266 


OREGON 


UTAH 


WASHINGTON 





Integrated Device Technology, Inc. 


(Western Headquarters) 
2975 Stender Way 
Santa Clara, CA 95052 
(408) 492-8350 


Integrated Device Technology, Inc. 
(Southwestern Regional Office) 


6 Jenner Drive, Suite 100 
Irvine, CA 92718 
(714) 727-4438 — 


-IDT TECHNICAL CENTERS 


Integrated Device Technology, inc. 


(South Central Regional Office) - 
14285 Midway Road, Suite 100 
Dallas, TX 75244 

(214) 490-6167 


Integrated Device Technology, Inc. 
(Eastern Headquarters) 


‘#2 Westboro Business Park 
200 Friberg Parkway, Suite 4002 
Westboro, MA 01581 
(508) 898-9266 


Integrated Device Technology, Ltd. . 
(European Headquarters/Northern Europe : 


Regional Office) 

21 The Crescent 
Leatherhead 

Surrey, UK KT228DY 
Tel.: 44-0372-363-339/734 





AUTHORIZED DISTRIBUTORS (U.S. and Canada) 


Alliance 


Contact your local office. 


Future 
Electronics 


~ Hall-Mark 


Hamilton/Avnet 


Vantage 
_-. Components 


Insight 


Zentronics 
Electronics ay 


AFRICA 


Monte Vista International 
5673 W. Los Positas Blvd., 
Ste. 205 

Pleasanton, CA 94588 
Tel: 510-463-8693 


AUSTRALIA 


George Brown Group 
Rydalmere, Australia 
Tel.: 612-638-1999 


George Brown Group 
Hilton, Australia 
Tel.: 618-352-2222 


George Brown Group 
Blackburn, Australia 
Tel.: 613-878-8111 


AUSTRIA 


Elbatex AG 
Hardstrasse 72 
CH-5430 Wettingen 
Switzerland 

Tel.: 011-41-56275-777 


BELGIUM 


Betea S.A. 
St.-Stevens-Woluwe, 
Belgium 

Tel.: 322-725-1080 


DENMARK 


Exatec A/S 
Copenhagen, Denmark 
Tel.: 45-31-191022 


FINLAND 


Comodo Oy 
Helsinki, Finland 
Tel.: 358-0757-2266 


FRANCE 


IDT 

(So. Europe Reg. Office) 
15 Rue du Buisson aux 
Fraises 

91300 Massy, France 
Tel.: 33-1-69-30-89-00 


Scientec REA 
Bordeaux, France 
Tel.: 33-56-39-3271 


Scientec REA 
Chatillon, France 
Tel.: 33-149-652750 


Scientec REA 
Cesson-Sevigne, France 
Tel.: 33-99-83-9898 


Scientec REA 
Rognes, France 
Tel.: 33-42-50-1805 


INTERNATIONAL SALES REPRESENTATIVES 


Scientec REA 
Schwerwiller, France 
Tel.: 33-88-82-5514 


Scientec, REA 
Saint-Etienne, France 
Tel.: 33-77-79-7970 


A2M 
Brignolles, France 
Tel.: 33-1-94-59-2293 


A2M 
Bron, France 
Tel.: 33-1-72-37-0414 


A2M 
BUC, France 
Tel.: 33-1-39-56-8181 


A2M 
Cesson-Sevigne, France 
Tel.: 33-1-99-63-3232 


A2M 
Le Chesnay Cedex, France 
Tel.: 33-1-39-54-9113 


A2M 
Merignac, France 
Tel.: 33-1-56-34-1097 


Aquitech 
Merignac, France 
Tel.: 33-56-55-1830 


Aquitech 
Cedex, France 
Tel.: 33-1-4-96-9494 


Aquitech 
Rennes, France 
Tel.: 33-99-78-3132 


Aquitech 
Lyon, France 
Tel.: 33-72-73-2412 


GERMANY 


IDT 

(Central Europe Reg. 
Office) 
Gottfried-Von-Cramm-Str. 1 
8056 Neufahrn, Germany 
Tel.: 49-8165-5024 


Jermyn GmbH 
Limburg, Germany 
Tel.: 49-6431/508-0 


Jermyn GmbH 
Berlin, Germany 
Tel.: 49-30/2142056 


Jermyn GmbH 
Dusseldorf, Germany 
Tel.: 49-21 1/25001-0 


Jermyn GmbH 
Heimstetten, Germany 
49-89/909903-0 


Jermyn GmbH 
Herrenberg, Germany 
Tel.: 49-7032/203-01 


Jermyn GmbH 
Norderstedt, Germany 
Tel.: 49-40/5282041 


Jermyn GmbH 
Nurnberg, Germany 
Tel.: 49-91 1/425095 


Scantec GmbH 
Planegg, Germany 
Tel.: 49-859-8021 


Scantec GmbH 
Kirchheim, Germany 
Tel.: 49-70-215-4027 


Scantec GmbH 
Ruckersdorf, Germany 
Tel.: 49-91-157-9529 


Topas Electronic GmbH 
Hannover, Germany 
Tel.: 49-51-113-1217 


Topas Electronic GmbH 
Quickborn, Germany 
Tel.: 49-4106-73097 


HONG KONG 


IDT 

(Hong Kong Reg. Office) 
Aim. 1505, 

15/F The Centre Mark, 
287-299 Queen's Road 
Central 

Hong Kong 

Tel.: 852-542-0067 


Lestina International Ltd. 
Kowloon, Hong Kong 
Tel.: 852-735-1736 


INDIA 


Malhar Corp. 
Bryn Mawr, PA 
Tel.: 215-527-5020 


Sritech Information 
Technology, Inc 
Javanagar, Bangalore 
0812-643608 


ISRAEL 


Vectronics, Ltd. 
Herzlia, Israel 
Tel.: 972-52-556070 


ITALY 


Lasi Electronica 
Bologna, Italy 
Tel.: (3951) 353815 


Lasi Electronica 
Firenze, Italy 
Tel.: (3955) 582627 


Lasi Electronica 
Milano, Italy 
Tel.: (39) 266-101370 


Lasi Electronica 
Roma, Italy 
Tel.: (19396) 5405301 


Lasi Electronica 
Torino, Italy 
Tel.: (3911) 328588 


Microelit SPA & SRL 
Milan, Italy 
Tel.: 39-2-4817900 


Microelit SPA & SRL 
Rome, Italy 
Tel.: 39-6-8894323 


JAPAN 


IDT 

(Japan Headquarters) 
U.S. Bldg. 201 

1-6-15 Hirakarasho, 
Chiyoda-Ku 

Tokyo 102, Japan 
Tel.: 813-3221-9821 


Dia Semicon Systems 
Tokyo, Japan 
Tel.: 813-3439-2700 


Kanematsu Semiconductor 
Corp. 

Tokyo, Japan 

Tel.: 813-3551-7791 


Marubun 
Tokyo, Japan 
Tel.: 813-3639-9805 


Tachibana Tectron Co., Ltd. 


Tokyo, Japan 
Tel.: 813-3793-1171 


KOREA 


Eastern Electronics 
Seoul, Korea 
Tel.: 822-553-2997 


NETHERLANDS 


Auriema 
Eindhoven, Netherlands 
Tel.: 31-40-816565 


NORWAY 


Eltron A/S 
Oslo, Norway 
Tel.: 47-2-500650 


SINGAPORE 


IDT 

(Hong Kong Reg. Office) 
Fim. 1505, 

15/F The Centre Mark, 
287-299 Queen's Road 
Central 

Hong Kong 

Tel.: 852-542-0067 


SOUTH AMERICA 


Intectra Inc. 
Mountain View, CA 
Tel.: 415-967-8818 


SPAIN 


Anatronic, S.A. 
Madrid, Spain 
Tel.: 34-1-542-5566 


Anatronic, S.A. 
Barcelona, Spain 
Tel.: 34-3-258-1906 


SWEDEN 


Svensk Teleindustri AB 
Spanga, Sweden 
Tel.: 46-8-761-7300 


SWITZERLAND 


Elbatex AG 
Hardstrasse 72 
CH-5430 Wettingen 
Switzerland 

Tel.: 011-41-56275-777 


TAIWAN 


Johnson Trading Company 
Taipei, Taiwan 
Tel.: 886-273-3121 1 


World Peace Industrial Co., 
Ltd. 

Taipei, Taiwan 

Tel: 886-2788-5200 


UTC 
Taipei, Taiwan 
Tel.: 886-2-7753666 


UNITED KINGDOM 


IDT 

(European Headquarters/ 
No. Europe Fieg. Office) 
21 The Crescent 
Leatherhead 

Surrey, UK KT228DY 
Tel.: 44-0372-363-339/734 


Micro Call, Ltd. 
Thame Oxon, UK 
Tel.: 44-844-261-939 


The Access Group Ltd. 
Hertfordshire, UK 
Tel.: 0462-480888 
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